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advertised capability to operate in parallel with the main processor (CPU), each one of 
them at rate of 10 mbit’/sec in each direction. It also presents to the reader an 
introduction to the machine itself, to the Occam Programming Language, a description 
of the environment at the Naval Postgraduate School{ NPS), and suggests to the novice 
a learning sequence. 

The evaluation programs and other example programs presented in this thesis 
were implemented using the Occam Programming Language (Proto-Occam) in either 
the Occam Programming System (OPS) or the Transputer Development System (TDS), 
both resident on the VAX 11/780 computer under the VMS Operating System 
(VAX/VMS). 
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[. INTRODUCTION 


A. BACKGROUND 

The NPS AEGIS project has in its primary goals the research and development 
of alternative architectures for the AEGIS Combat Weapon System (CWS), focusing 
on low cost, reliable and fault tolerant architectures. As the cost of micro-processors 
has been decreasing incredibly and the capabilities are always increasing, it turns out to 
be very attractive to think of using these cheap and powerful tools to accomplish the 
functions of any system. 

One branch of this research is based upon the Intel 86/12A Single Board 
Computers that are working under the MCORTEX operating system [Ref. 1], fully 
developed at the AEGIS lab. It exploits the 10mbits/second capacity of the Intel 
MULTIBUS and uses the concept of shared memory to allow multiprocessors arranged 
in clusters of up to eight single board computers, to increase the throughput of the 
system. Each cluster has its own shared memory whose access is controlled by means 
of eventcounts. The clusters intercommunicate through an Ethernet link [Ref. 2]. 

One alternative concept for distributed systems is the use of message passing 
[Ref: 2,3]. The Transputer concept exploits this idea and produces a very interesting 
and flexible way of designing multiprocessor systems. This second branch of research is 
now in its third released work! and is increasing in importance and extent. 

This thesis was developed in parallel and concurrently with the one from 
Cordeiro. M. M. [Ref. 6]. Since these theses were in fact the first to really program 
this new machine, at the NPS, a series of obstacles were encountered and overcome 
one by one, up to the point we were able to divide the work, and on our own, search 
for the answers we were individually seeking. This is the reason why we tried to point 
out manv of the pitfalls that one may encounter in future research in this area using 


the system available at the NPS. 


‘See B. Evin Implementation of A Serial Delay Insertion Type Loop 
Communication for a Real Time Multi-Transputer System [Ref. 4] and Selcuk, Z., 
Implementation of a Serial Communication Process for a Fault Tolerant, Real Time, 
Multi-Transputer Operating System [Ref. 5]. 
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1. Intended Audience 

This will be a good first reading for the person beginning to work with 
transputers or Occam. Appendix A presents a Learning Sequence; Appendix B presents 
an OPS Tutorial; Appendix C presents a TDS Tutorial and Appendix D provides some 
hints on how to program in Occam. I[t also will be a good reference for transputer 
users and real-time svstem designers and implementors in the sense of what they can 
expect in terms of performance from the INMOS links. They will be able use the 
Tables, Graphics and the Evaluation Programs to check and confirm their expectations 
In issues Concerning what should be the right construct or the right message size to use, 


in order to achieve the desired throughput or communication rate. 


B. WHY THE TRANSPUTER 

The TRANSPUTER is a member of a family of micro-processors, that have on 
one chip, the processor, its own local memory and links for point to point connections 
to other transputers. 

Each transputer product contains special circuitry and interfaces adapting it to 
each particular use. For example a peripheral control transputer, such as a graphics or 
disk controller, has interfaces tailored to the requirements of a specific device [Ref. 7]. 

The transputers were designed in parallel with the Occam programming language 
and were first released in 1985. Now, two years later, there is a growing variety of 
transputers available on the market with different capabilities and for different 
applications. Some of these are listed in Appendix G. 

The T-414 is a 32 bit micro-processor with 2 kbytes of on chip RAM, four 
standard iNMOS serial links, external memory interface and peripheral interfacing on a 
single 1.5 micron CMOS chip. As an example, its characteristics and technical data are 
summarized in Table I, and its internal architecture is depicted in Figure Le 

For the sake of comparison, Table 2 lists the processor cycle time or internal 
clock of other commercially available computers and also some processors used in 
military applications for real-time. 

The internal architecture of the transputer follows Von Newman principles and 
permits the processor itself to run at the same time as the 4 links operate. This way a 


high level of parallelism is achieved already on chip level. 


“Reproduced by permission of INMOS Corporation. 


TABEE 1 
TRANSPUTER T-414 TECHNICAL DATA AND CHARACTERISTICS 


processor internal imstruc ciicw 
| cycle time Crock throughput 
(CT 4-200. 50° nsec eee MZ ee MIPS 
(i 414-15)2 ee. 67 NSCC oy eames Wi) een ee bree 
(Tea 14=32) eee 80 nsec .... 12.5 mhz .... 6.0 MIPS 
external clocn Cycles 5 iaZ 
time slice: 20te-. . oe 1 msec (approximately) 
internal bus speed .....-.. 80 mega bytes/second 


internal (on chip) memory. 2 kilo bytes 
internal memory cycle .... 50 nsec (for 50 nsec cpu) 


external memory interface. 25 mega bytes/second bus 


external memory cycle ....150 nsec 

address Capabiivey 4 giga bytes(32 bit address) 
Pinks (iserial je). eee 4 (full duplex, DMA) 

Link bit Tratgel..3.32) 4 ee 10 mbits/sec (20 mbits/sec) 
Jank net bit rate (Obs. 1) 3.8 mbits/scecug@e.! mbits 7 
power dissipation ........ less than 500 milliwatts 
physical dimensions 2a 45 mm square chip (84 pins) 


Obs. 1: These values refer to the immediately above 
mentioned bit rates, respectively, and are 
fully explained on chapter III. 





When reading transputer related material, one may find references to T-424. This 
was a prototype that 1s not on the market anymore. 

The systems architecture is simplified bv using the transputer links for point to 
point communications which allows the available transputers to be configured in any 
desired topology matching the programmer needs [Ref. 4,8]. Point to point 
communication links have many advantages over multiprocessor buses [Ref. 7]: 


* There is no contention for the communication mechanism, regardless of the 
number of transputers in the system (that does not happen in shared memory 
systems) (Ref. 9]. 


ele. 2 
Biers SOheey Clie TIVmGLOCK EXAMPLES 


SEC pao ee cee, ig “o's. i. eee ie OuS. mz ) 
Seco ae: a , , Stow mhz } 
SCS SMe ce eM eS os a oo a 16 mhz) 

ec wearer ee 50) nsee (eaemlrz ) 

BON ON ne ob ee als oe oe 750 nsec 

SN/UER 43 sah Ye (eg eames 150 nsec 

VAX 11/780 mi ee eases Z2O00 nsec 

1S: asides Wer s.8 2/7 nsec 

Leries Os (k) ee alge ts z6 nsec 


e There is no capacitive load penalty as transputers are added to a system. 
(specially if thev will work independently) 


However as the number of transputers increase in the system, a message routing 
system is needed in order to permit indirectly interconnected transputers to 
communicate to each other. This will create some overhead for the system and 
Cordeiro [Ref. 6], addresses this point presenting a design and implementation for such 
a system. 

It is up to the programmer to decide which process should be placed in which 
processor. For efficiency purposes, it 1s recommended to place frequently 
communicating processes in adjacent transputers ( directly connected by a link ). 

It is still possible however, to adapt previously designed systems, to this new 
architecture and develop a systems architecture where a central data base would be 
managed by a central transputer, which would address a large memory that could be 
read or Written by the processors connected to the four links, as depicted in Figure 1.2. 
But this would involve further study and it is not in the scope of this thesis. 

Another point worthy to mention is that although this work was developed using 
OCCAM, there is already available a C? compiler, and coming soon a Pascal and a 
Fortran compiler. The Ada compiler for this machine 1s under development and 
according to INMOS representatives, it will be released before the end of 1987. 

This way the final goal of the AEGIS project, that is to research alternative ways 
of implementing the AEGIS system will have the DoD language available on the 


present machine. 


>The C programming language compiler generates code for the transputer 
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Figure 1.1 Block Diagram of Transputer Architecture. 


It is also pertinent to mention at this point that in the last Occam User Group 
meeting, that took place in Santa Clara, CA, in March 10th, 1987 there were 
representatives of IBM, Tektronix and other major corporations showing to the 


participants, work in development and developed by them, using the Transputer. 


C. THE OCCAM PROGRAMMING LANGUAGE 

Occam 1s a programming language that since its first release in 1983 is Known as 
very suitable for description of multiple processor systems [Ref. 10], due to the 
simplicity with concurrency and parallelism can be addressed [Ref 11]. 

In fact, since then, the language has been modified and enhanced in its 


capabilities, and one of the latest versions, known as Occam 2 is described in the book 
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Figure 1.2 System Using a Transputer as Memory. 


by Pountain [Ref. 12]. However, this thesis was developed using one of the primitive 


+ that is best described in the Occam 


versions of the language called Proto-Occam 
Programming Manual [Ref. 13: section 3], with slight modifications introduced by the 
OPS’ TDS compilers implementations described in detail in the Occam Implementation 
section in the OPS Manual [Ref. 13: section 4]. 

The goal of this section is to address briefly the primitive processes and 
constructs used in Occam (Proto-Occam), calling attention to the limitations and 
capabilities this version of the language has. Appendix D presents some hints for 
programming the transputer T-414 using Proto-Occam. 

1. Primitives 

a. The Channel 
The channel (CHAN) is an identifier used for performing communications 
between concurrent processes (if in the same processor) or processes executed in 
parallel (if in different processors). We can think of the channels as a pipe that 


connects horizontally two processes that are being executed concurrently or in parallel. 


*Proto-Occam is so called in the Occam Programming Manual, but sometimes it 
is also referenced as being Occam 1. 
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If the processes are in the same processor (same transputer), this is done through a 
specified memory location determined at compile time, as if it were a global variable; 
but if the communicating processes are in different transputers, the channel uses the 
physical links connecting the transputers. Any type of variable may flow through the 
channel, but the programmer must ensure that the type being transmitted is the same 
that is being expected at the receiver, or the compiler will flag an error. 

This is the basic for the primitives input and output: 


¢ chanin ? char = [his can be read as the variamle enar will receive a valucetnat 
is coming from elsewhere through the channel chanin. 


¢ chanout ! 5 - This can be read as the constant “5” is being output to another 
process through the channel chanout. 


This implies that somewhere in our transputer network there will be a 
process that is transmitting some value through the channel “chanin” and another (or 
may be the same) process is receiving into some variable the value “5” through a 
channel called chanout. 

2. Constructs 


Occam has six basic comstiuects: 


a the sequential (SEQ) construct 

b the parallel (PAR) construct 

c the alternative (ALT) construct 

d the conditional (IF) construct 
the repetitive (WHILE) construct 

f the replicators (FOR) construct. 


The sequential, conditional and repetitive constructs have the same usage as in 
many other structured languages.> It is interesting to note the necessity of having a 
SEQ construct, because normally in such languages this is the only way to execute a 
program. 
a. The PAR Construct 
A parallel construct causes its component processes to be executed in 
parallel, if the component processes reside on different transputers, or concurrently in a 
time shared fashion, if they reside on the same processor [Ref. 13: section 3, item 3.4.2]. 
Note from Figure ie tian 
* Process one and process two are different processes. 


¢ Occam is fixed format and indentations are always 2 spaces for nesting. ~ 


Like Pascal, Ada or C programming languages. 
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| 
CoAtiMeonmns mec 1, c2-: --- channel declarations 


PAR 
WHILE TRUE aa sOCeSS Olle 
gaq * 
| el as 
eens ' x === 6nd process one 
Wale IRUE === process two 
VAR y : 
SEO 
comms 2 ¥ 
Cae ¥ “~-- end process two 





Figure 1.3 Example of a PAR Construct. 


el nereeare mo bests or eilds to delimit processes. 


e We can declare variables anywhere in the code as long as it is before the 
beginning of the process that will refer to it. 


e Three dashes (---) are the indication for comments following them. 

(1) The PRI PAR Construct. The priority parallel construct, a variation 
of the PAR construct, permits at most two processes under it. The first one will be 
given priority 0 (high), and the second one will be given priority | (low). This maps 
exactly to the two priority levels that the chip supports. As the Reference Manual 
[Ref. 7: p. 3], savs, the priority process is expected to be executing for a short period of 
time because when it begins, it can not be preempted. 

b. The ALT Construct 
An alternative construct is used to accept the first message available from a 


number of input channels [Ref: 13: section3,item 3.4.3]. See Figure 1.4. 


CHAN cl, c2 
WHILE TRUE 
VAR xX: 

ALT 


Figure 1.4 Example of an ALT Construct. 
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Note from Figure 1.4 that: 


® We could have any number of channels under the ALT and all of them 
outputing to c3. This 1s a construct that provides mutual exclusion © in two lines 
of code. 


e All variable declarations are separated by commas and terminated by a colon. 
There is also a variation of the ALT construct named PRI ALT, that 
enables the first option of the ALT be executed in precedence to the others. 
c. Replicators 
A replicator may be used with a construct SEQ, PAR, ALT or IF to 
replicate the process a number of times [Ref 13: section 3, item 3.4.6]: 
° SEQ - When used with SEQ it provides a conventional loop. 


e PAR - When used with a PAR it makes an array of concurrent processes See 
fet uiegias 


e ALT - When used with ALT it enables to receive one unique input at a time 
from an array of channels. See Figure 1.6. 


CHAN Wel =a ome 
PAR i = [O FOR nl] 
WHILE TRUE 
VAR X : 


SI 
Sl ee oe 
i 1) fee 





Figure 1.5 Replicated PAR. 


3. Good Features of Proto-Occam 
Proto-Occam has some nice features like: 
e the facility in handling time for performance evaluation (TIME ? var) 
* the use of time delay (TIME ? AFTER sometime) for real-time applications 
e the SKIP that has numerous applications and help to handle exceptions 
® we can access the byte in memory 


e there is no need to declare count variables used in replicators 


©Mutual exclusion is one of the critical issues in Operating System design [Ref. 3] 
and it is neatly handled by the ALT construct. 


Ze 


CeaNiee| nn) a: 
WHILE TRUE 
VAR y 
ALT 1.= [ 1 FOR n ] 
ee y 
at ox 


Figure 1.6 Replicated ALT. 


we can have procedures with formal parameters being arrays of variable size; 
this way the actual parameters may be of different sizes in different procedure 
calls. 


4. Proto-Occam Limitations 


Many of the limitations of Proto-Occam have been fixed by Occam 2, but 


they are still note worthy: 


D. 


there are only one dimensional arrays 


there are no types; the programmer has to establish a convention to use its 
variable names and make sure to address them coherently. 


no floating point 1s available 
no recursion 1s permitted 


no pointers are available 


THE ENVIRONMENT AT THE NPS 


1. Software Facilities 


The Naval Postgraduate School has several Software tools available in its 


computer labs: 


Occam Programming System (OPS), available in the VAX/VMS. It permits 
editing, compiling, linking and running on the VAX, concurrent programs 
Written in Occam, simulating a network of transputers. It will be briefly 
described in Appendix B, but the reader may refer to [Ref. 13]. 


Transputer Development System (TDS D600), available for the VAX/VMS, it 
edits, compiles and down loads the code into the transputer network. It will be 


briefly described in Appendix C, but additional information may be obtained in 
[Ref. 14]. 


Transputer Development System (TDS D701), available for PC-AT type micro- 
computers. It edits, compiles, links, and down loads to the transputer network 
the code to be executed (that was generated on the PC). It is single user and 
requires installation of the BO04’ board in the PC. It uses the Occam 2 


"Described in Table 3. 
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programming language. This system arrived at the lab at a point in time that 
this thesis was already partially written and so it will not be addressed. For 
niore information refer to [Ref. 15]. 


2. Hardware Facilities 
a. Transputer Boards 
The transputer lab has a Transputer Evaluation Module with seventeen 
(17) transputers in the following configuration: 


e one board with one transputer (T414-12) called BOO] [Ref. 16], that is the 
interface with the VAX/VMS. 


¢ four boards with four transputers (1414-15) each called BOO3 [Ref. 17], that can 
be used either with the VAX or with the PC. 


It also has one board with one transputer (T414-15) called BOO4 [Ref. 18], 
that is the interface with the PC, and is located in one of the slots of the Zenith Z-248. 
This makes a total of 18 transputers to work with. 


Table julists itsicharnactensucs. 


* Each number stands for a Link. 





Figure 1.7 The Four Transputers in the BOO3 Board - Fixed Links. 


These transputers can be interconnected and configured in any way 
designed by the programmer using the INMOS links as long as the hard wired board 
connections between transputers (that already exits and are fixed in all BOO3 boards in 
the LAB) are respected’| Rei 17 Pa Secu tainre sl 
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AES 
CHARACTERISTICS OF BOARDS BO01I, B003 AND B004 


Saemiiis f S8oe- 12 mhz transputer. 
10 mbits/sec INMOS link transmission speed 
o4 kbytes of static RAM ee xX IMS 1400-45) 
Zo Key ccamerROM (4 x 27 
.bootstrap loader, 
.memeory test, 
. terminal to_host transparent mode software 
ene ee Serial Inpuc/OUteuUt Connectors for : 
~VAX connection 
meri nalweomiect lon : 
- 64 way DIN connector for external link connections 


| 
| 
| 
a. BOO1 Board 


5o) Gentaining = 


4 IMS T 414-15 mhz transputers — 

1Q0 or 20 mbits/sec INMOS link transmission speed 
Z26 kbytes dynamic RAM per Peaneraes 

96 way DIN connector for external link connections 


b. BOO3 Board 
: 
| 


c. BOO4 IBM-PC Add-in-Board 


one T 414-15 mhz transputer rh 

10 mbits/sec INMOS link transmission speed 
Z2 mbytes dynamic RAM with parity 

62 pin I/O channel connector 





The BOO! board is the interface between the VAX and the transputer 
network. The interconnection is done through standard RS 232. 

The user can develop OCCAM programs on the VAX, debug and test 
using the OPS, and when ready, down load them to be run on the transputers. See 
Appendix D. 

b. Host Computers and Terminals 

(1) VAX. To use any of the systems (TDS or OPS) on the VAX, the user 
must log in from any VT 100 or VT 220 terminal ( this last one has to be in VT 100 
mode, and VT 100 id ). Appendix A presents a detailed sequence for this. 

To be able to down load the executable code, the terminal must be also 
connected to it. There are two ways of doing it [Ref. 16,] and Figure 1.8 shows how 
this 1 done at the NPS lab. 

The following advantages should be pointed out: 


@ The VAX provides us with the VMS Operating System and all the facilities a 
mini-computer can support, mainly a weekly system backup that we do not 
have to worry about. 


Zo 





e Ifa modem is available, much of the work can be done from home. 


e Mfulti-user utilization as far as using the OPS and editing and compiling in the 
TDS (Very handv for class projects). 


e Occam 2 will be available soon in the VAX at the NPS, as an upgrade of the 
OPS and TDS systems. 


The only disadvantage is that when the VAX is down for backup, 
upgrades or repairs, there is nothing the user can do about it. 

(2) Zenith Z-248. The TDS system for the PC is completely independent 
from the VAX. It has a new version of OCCAM more powerful and flexibie. It is 
installed in a Zenith Z-248 micro-computer (PC-AT compatible), with 2.5 mbytes of 
RAM and 8 MHz clock. 

There are two advantages in having a PC 

first the user has the whole micro for him and no problems, except a TDS system 
failure, Would delay any project. An assumption 1s made that to replace a PC is an easy 
task. Secondly, the Occam version running on the PC is temporarily® newer than the 
one on the VAX, and new horizons are opened for research. 

As mentioned before, this thesis was developed on the TDS and OPS 
installed on the VAX and it will not have any other information on the PC based 
svstem. 

c. Printing Facilities 
There are two ways to print OCCAM programs developed on the VAX: 


e Using the VAX / VMS online printer (only files with extensions “.Ist” and ”.his” 
are printable). 


e Using the printer at the lab and the print screen facility provided by the VT 220 
terminals. Anything that is on the screen can be printed this way, and this 
turned out to be one of the best debugging and analyzing tools for the research. 


E. SPRUCTURE OF THE THESIS 

This thesis is presented in 5 Chapters and 7 Appendixes. 

Chapter I was the introduction to Occam, the transputer and the NPS 
environment. Chapter II describes the terminology, the INMOS Links, the methods 
used for performance evaluation, and state the expected results and research questions. 

Chapter III and Chapter VI address each one of the research questions, 
describing the experiments done and presenting the results obtained and conclusions 


reached thereto. Chapter V summarizes the conclusions and suggests future research. 


SThe Occam 2 version for the VAX/VMS will be available at any moment. 
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As mentioned alreadv, Appendix A presents a Learning Sequence for how to 
work with the transputers and Occam, having the VAX/VMS System as a host. 
Appendix B and Appendix C, are tutorials about the software tools available presently 
for the VAX , the OPS and the TDS systems. 

Appendix D presents some hints in how to program in Occam, and call attention 
for some mistakes that most likely one will make when using this new language on a 
new system, with a different and powerful fold editor. 

Appendix E lists the Link Evaluation Program used, and Appendix F lists the 
Test Linearity Program, both with all procedures and library routines that were used. 
When reading the listing files take into account that : 


¢ Occam is a fixed indentation language with two spaces between each nested 
[evel 


e Two dashes (--) marks the begining of new folder with the title aside. 


e Three dashes (---) means that comments follow on that line only. 
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A. 


Il. COMMUNICATION AND PERFORMANCE ISSUES 


COMMUNICATION ISSUES 


The purpose of this section is to set the stage and define a series of 


communication terms that will be used in the following discussion about the transputer 


physical links performance. 


1. Definitions: 


frame - it 1s a packet of bits containing 8 bits plus the frame protocol bits ( e.g. 
Start bit, stop bit, and parity bit). 


bit rate - it is the number of bits that can be transmitted in a unit of time ( e.g. 
Kbits, sec or mbits; sec). 


baud rate - is the number of signal elements transmitted per second. If there are 
only two signal elements (0 and 1) then the baud rate is equal to the bit rate.As 
this is the case on the transputer we will mostly refer to bit rate. 


data rate - It is the number of data elements (bvtes) transmitted per unit of 
time. Normally it is expressed in Bytes per Second. It is always smaller than the 
Pimrate civided by 8, ducito the control bitsmeeded in each frame. 


net bit rate - (or transfer rate) will be defined by the author as 8 times the data 
rate. [his was used to make comparisons to the values advertised. 


2. Data Transmission Basics 


a. Modes of Operation 


parallel transfer mode: when multiple wires are used between the two 
equipments , each one of them carrying one bit of the frame. 


bit serial transmission: when only one wire is used to send the frame, one bit 
eiter the Other: 


b. Communication Modes 
simplex : when data is being transmitted in one direction only. 


half duplex: when data is being transmitted in both directions but alternately 
(switching between transmit and receive mode is necessary). 


duplex - (or full duplex) when data is being exchanged in both directions 
simultaneously. 


c. Iransmission Modes 


Asynchronous Transmission - when the receiver and transmitter clocks are 
independent. Each frame received reinitializes the clock, as the start bit is 
received. It is used when the rate at which characters are generated is 
indetermined and hence the transmission line can be idle for long periods in 
between each transmitted character. 
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Svnchronous Transmission - When receiver and transmitter clock are dependent 
and information is packed in long streams of characters instead of byte by byte. 
Use special synchronizing bytes before each block. 


Most of the information contained in this section was taken from [Ref. 19], 


and it is just included here to make the reading smoother. 
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THE TRANSPUTER LINKS 


“The transputer architecture simplifies system design by using point to point 
communication links. Every member of the transputer family has one or more 
standard links, each of which can be connected to a link of some other 
component. This allows transputer networks of arbitrary size and topology to be 
COM eucte ds: 


This quotation extracted from [Ref. 7: p.6], gives us a macro sense of what the 


link is and how it can be beneficial for the programmer. Following the terms described 


in the previous section, we can say that the transputer links are serial, full duplex, 


asynchronous communication devices that have a bit rate of 10 mbits/sec or 20 


mbits/sec (when available). They provide synchronization between communicating 


processes on a transputer network. 


To provide the reader with a better understanding, the following includes some 


details about the links, extracted from [Ref. 7: p.7]: 


Each physical link provides two Occam channels, one in each direction(input 
and output). The T-414 has four(4) links, so we have 8 physical channels for 
programming purposes in each transputer. 


Communication via any link may occur concurrently with communication on all 
other links and with program execution. 


Synchronization of processes at each end of a link is automatic and requires no 
explicit programming. This is one of the important features one can use with 
the transputer. The links are the concurrency tools and are very easy to 
program by using the Occam channels. 


The information is transmitted on the link in the format depicted by Figure 2.1 , 
where the two beginning “1” are start bits and the ending “0” 1s the stop bit. 


After transmitting a data frame (one byte), the sending transputer waits for an 
acknowledge (ACK) from the receiving transputer, signifying that the byte was 
received and it the link is ready to receive another byte. If the ACK is not 
received the communications on that link will stop. 


[t is still worth mentioning one of the questions we had about how they work: 


“How could a process waiting for communication waste no cpu cycles? 
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a. The DATA frame 


| 0 b. The Acknowledge frame 





Figure 2.1 The Data and Acknowledge Frames. 


The information we have got verbally from Mr. Neil Mitchell from INMOS 
office in Santa Clara was that the links have a 1 byte buffer inside it. When a process 
has to transmit, the first byte of the message is, in fact transmitted and it is received by 
the link on the receiving transputer, and stored in this buffer. Two situations may 
occurthen: 


1 If that receiving link is already waiting for an input, the acknowledgement 1s 
sent right away to the transmitter by the other channel, and this is all the 
transmitter needs to follow on with the message. 


2 If that receiving link is not waiting for any input yet, the acknowledgement 1s 
not sent. What happens in the sending transputer is that, as the ACK does not 
arrive, the process is placed on the wait queue, and a pointer to that process is 
placed in the respective channel memory location (each channel has its own) 
until the ACK arrives. When this happens, the process is awakened and the 
message is then transmitted. 


C. EXPECTED RESULTS 
Assuming we have a bit rate of 10 mbits/sec and the frames have no delay 
between them, two situations should be considered: 


¢ One Channel Transmitting - when only one of the channels is being used for 
transmission (or reception) of messages at a time, and so the other channel is 
free to bring the ACK frames. 


e Both Channels Transmitting/Receiving - when we have message passing in both 
channels at the same time and so the ACK for a received frame is piggy-backed 
(appended to the end of the frame) [Ref. 19: p. 129], to the next transmitting 
frame. 


en 


1. One Channel Transmitting 
In this case, there is no ACK sharing time with the frame on the channel and 
we will get the maximum rate possible as follows: 


e Net bit rate = (8/11) * 10 (mbits/sec) — 7.2/7 mbits/see Gre 27 opis sec. 
Where § is the number of information bits and 11 the total number of bits in a 
frame. 


e Data rate = 7.27/8 = 0.91 mbytes/sec or 909 kbytes/sec. 
2. Both Channels Transmitting/ Receiving 
In this case we will have: 


© Net bit rate= (8/13) * 10 (mbits/sec)= 6.15 mbits/sec or 6,154 kbits/sec; where 
13 stands for the 11 frame bits plus 2 ACK bits that are now sharing the link 
also. 


e Datarate = 6.15 /8 = 0.77 mbytes/sec or 769 kbytes/sec. 


The results are summarized for 10 and 20 mbits/sec rates in Table 4. 


TABLE 4 
EXPECTED MAXILVIUM IRANSFER RATES ON Tie SRAN sive 
LINKS 
link bit rate no 20 mbits/sec 
One channel 7 O27 14,545 kbits/sec 
Both channels 6,154 127008 kbits/sec 


The reason for mentioning the values in kbits/sec is due to the non-availability 
of floating point and this way, to get some precision, we needed to use this unit in all 
performance measurements during the evaluation. 

It is worth mentioning that these values were expected for either one single 
channel, or the eight channels operating in parallel because the memory 1s multi-ported 
and permits access to each one of the links and the processor in an interleaved mode. 
[Ref. 7: section 2, p.1]. It was also expected that these rates should not be affected by 
another process using the Central Process Unit (CPU) for calculations and memory 


accesses at the same time, for the same reasons mentioned above. 


D. RESEARCH QUESTIONS 
From the above, some research questions could be devised as follows: 


1 Does a link transmit at 1Ombits/sec and 20 mbits/sec transfer rate? 


SZ 


tm 


fsa Gd 


ay 
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Is the ACK really transmitted as soon as the receiver channel receives the first 
bit of the data packet? 


Is the communication between the transputers really occurring in parallel? 
What is the effect of message lengths on the link transfer rates? 


What 1s the mutual effect on the link transfer rates, of more links operating in 
parallel in the same transputer? 


Can the CPU work in parallel with all the links? 


What is the effect of a communication independent process, running on the 
CPU, over the transfer rates obtained in a link by another process, in this 
transputer? 


What is the effect of the communications, over the process that is being 
ecculted: in the CPU? 


Does the Transputer achieve linear performance improvement? 


What happens when a process 1s time sliced in the middle of a communication 
by physical link? Does the link stay blocked? 


Questions | through 6 will be discussed in Chapter III, questions 6 to 9 in 


@iaptrer 1y> Question 101s still pending anesis left for further research. 


E: 


PERFORMANCE NIEASUREMENT ISSUES 
ms mentioned in the paper by Ceilary [Ref. 20], there are five methods for 


computer network measurements, depending on the approach used for data gathering. 


They are: 


© 


Standard User Method, 

Reference User Method, 

Software Monitoring Method ( Programs ), 

Hardware Monitoring Method ( Probed Equipments), and 
Hybrid Monitoring Method ( A mix of the two above). 


In this thesis both Software and Hardware monitoring methods were used for the 


following reasons: 


The hardware monitors are more reliable than the software monitors. 


For statistics purposes and for large amount of data, some times it 1s impossible 
to obtain, using hardware measurements, the same amount of information that 
can be collected by software programs, in a same period of time. 


This way, we used hardware monitors to confirm preliminary results obtained by 


software and after validating them, a massive collection of data was gathered to permit 


and back up the conclusions reached. 


1. Hardware Methods 
Two approaches were used: 
e by using a Oscilloscope to monitor | or 2 channels of a link at the same time. 


® bv using a Logic Analyzer to monitor 4 and 8 channels (in 4 different links) of 
the same transputer. 


a. Using the Oscilloscope 
The idea of using the Oscilloscope was to identify on the screen a known 
pattern of bits in continuous transmission, and also to obtain an approximation of the 
bit rate. Also by observing subsequent frames, try to estimate the data rate and the 
interval between frames. Another observation that could be made, as seen in the 
following Chapter and also documented by using Polaroid photographs, is the relative 
position of the Acknowledge (ACK) frame, in reference to the transmitted frame, in the 
second oscilloscope channel. 
The equipment used was the Tektronix 364 Storage Oscilloscope and the 
camera was the Hewlett Packard HP-24A. 
b. Using the Logic State Analyzer 
The idea of using the Logic Analyzer was to monitor several channels of a 
same transputer and really see if there were bits been transmited at the same time, in 
some or all of the channels. Our Logic Analyzer has the capability to monitor 32 
channels and store 250 subsequent bits in each in each channel after triggered. 
As all channels are asynchronous, an external clock was necessary and so a 
Pulse Generator was used to provide this clock. To help in getting a more precise clock 
a Digital Counter was also used to sample it. The equipments used were: 
e Logic State Analyzer Mod. 532 with Analyzer Probe Model SIA. 
® 20 mhz Function/Pulse Generator Wavetek Model 145. 
® Measuring System Hewlett Packard model HP-5300A. 
One problem arose from this: 


¢ The maximum external clock frequency acceptable by our logic analyzer was 12 
mhz and as recommended by Nyquist relation, we should have a sampling 
frequency at least the double of the sampled signal (Normally 16 times is used) 
[Ret io spate: 


In our case, the sampled signal was supposedly at 10 mhz and so a 
minimum clock of 20 mhz should be used. As the Logic Analyzer did not permut that, 
we used a 10mhz pulse instead as clock, and, by trial and error varying the clock 


frequency and pulse width, after numerous tentatives we obtained some representative 
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results that are presented in the following chapter. It is good to mention that we did 
not even trv to monitor tne links running at 20 Mhz for the same reason. 

c. Test Points 

To monitor the links activity, a homemade monitoring bridge that was able 
to connect up to eignt channels was used and, with it, we had the ability to monitor 
the four links of a transputer. 

2. Software Methods 

With this respect, several programs were made at first to compare the rates 
obtained in hardware with the ones in software, and for the final report on the links 
performance, a complete Link Evaluation Program was designed, to handle all possible 
cases of constructs to communicate, several kinds of channel parallelism and two 
different cases of CPU load, concurrently with the communications. The output of this 
program was a table of values that was used to generate some graphics using the 
EASYPLOT system at the IBM 3033. Appendix E presents a listing of the evaluation 
program with the Occam library used. The terminal driver is the one provided by 
ieee oe wth the Kévboard and Screen references made using the first letter in 
uppercase, and therefore is not included. 

The library.occ is a collection of previous existing procedures, some generated 
by the manufacturers and some made originally for the OPS System by previous 
workers, updated to be used on the TDS, plus additional procedures for 1/0 and utilities 
written by Cordeiro and myself. They can be browsed on Appendix E, inside the 
program listing. 

To observe the effect of multiple transputer execution of the same program, a 
series of versions of Program TEST LINEARITY were made and the 17 transputer 
version 1s listed in Appendix F. 

All programs above used basically the same three tools: 


e - The TIME channel provided by the compiler and Occam to read the internal 
transputer clock in ticks. Table 5 summarize them. 


e - the tick.to.time procedure used to convert time from ticks into hours, 
minutes,seconds and milliseconds. It receives as input parameters the “startime” 
(in ticks), the “endtime” (in ticks) and the transputer type, and outputs to the 
screen the elapsed time in hours, minutes, seconds and nulliseconds, for the 
specified transputer. This routine is listed in Appendix F. 


e - the transfer.rate procedure similar to the previous one but which computes 
the transfer rate measured in the channel observed. It receives as parameters 
“startime’, “endtime”, “transputer type nr.”, and the “size of the message” 


TABEE.S 
THE DIFFERENT Gh ers 


T-414 12mhzZ ---e------- 
T-414 1iSmhz (high) --- 
T-414 i5mhz (low) --- 
VAX/VMS -----+--------- 


transmitted and outputs the transfer rate 


routine is listed in Appendix E. 


> te hee 
> 1 aa 
> ene Ge 
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1.6 micro-seconds 


micro-second 


64 micro-seconds 


100 nano-seconds 


through the variable “rate”. This 


Ii. THE EVALUATION STARTS 


A. INTRODUCTION 

In this chapter we start to address the research questions related to the 
evaluation, as listed in Section D of Chapter II. 

Section B describes how we verified that the bit rate is indeed, 10 mbits/sec or 20 
mbits'sec. It also shows the maximum values achieved for the net bit rate (transfer 
rate), for the various construct types. 

Section C shows the configuration used and demonstrates that the transfers in 
different links occur in parallel. eventually in all 8 channels of the 4 links. 

Section D describes the message size, and the channel parallelism effects on the 
transfer rates for the various constructs. 

1. The Available Constructs 

Occam permits us to use several different primitives and procedures for 
communications between processes. The first to be mentioned are the input and output, 
already explained in Chapter I. We used them in two modes: 

¢ transmitting bytes (characters), or 


® transmitting words (integers). 


BYTE.SLICE.OUTPUT (chanid, buffername, initbyte, blocksize) 

where: 

- chanid - the channel name where the communication will occur 
- buffername - the name of the array of variables 

- initbyte - the array index of the first byte to be transferred 


- blocksize - the number of bytes to be transferred 


ican sl crm leMeieeow ie | Procedure Call. 


Miemchidemodeisstnceb wilt sSelIGE MNPUY and BYTE SLICE OUTPUT 
procedure. These procedures are microcoded subroutines that provide a block transfer 


of bytes. Figure 3.1 shows the procedure call and an explanation of the parameters 


(Ref. 14: section 4]. These procedures cannot be used when doing programs for the 
OPS. The advantage they bring us, is a better performance, but when using OPS we 
are not concerned about it. 

The last mode 1s the WORD SEICE INPUI and WORD SLIiIG= opie 
procedure, also microcoded, that provide block transfer of words. As just mentioned 
above, the procedures showed to be much faster than the input/output primitives, but 
with similar performance to the BYTE SLICE procedures. 

2. Considerations About Memory Management 

As we have a machine with internal and external memory with different 
performances and address capabilities, this was a major concern, as far as performance 
could, and in fact is, undoubtedly affected. The documentation is not clear enough to 
pernut us to assure how this is handled by the processor, in the b001 and b003 boards. 
We tried to check the addresses mentioned in |Rel 7 sectionsgape. oe ee weovene 
not able to verify that. 

What can be said, though, is that it looks like the memory (internal plus 
external) on the BOO! transputer board is divided into four memory banks, each one of 
them beginning at addresses 0, 16k, 32k, and 48k , and the data and programs are 
mapped evenly over these four banks. We reached this conclusion after browsing 
several listings of the memory contents obtained from the transputer in the BOO] 
board. through a “dump” routine designed and implemented by M. Cordeiro, also part 
of the LIBRARY.OCC, included in Appendix F. 

In our evaluation program outputs, we tried to observe any noticeable effects 
that could be explained by a fast or slowest memory access, but the evidences were not 
strong enough, as it will be mentioned further on. As a curiosity, we measured the time 
to initialize four arrays of 15,000 bytes each in the BOO] memory and we have got 133 
msec! We assumed that programs smaller than 2k bytes long, will be loaded entirely 
into internal memory, but we could not prove it and this is left and strongly 


recommended for further research. 


BB. A CLOSE LOOK ON THE BIT RATE 
The evaluation started trying to answer research question | that 1s transcribed 
mene. 


e¢ “Do the links transmit (and receive) data at 10 and 20 mbits/sec transfer rates?” 
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1. First Software Results 

To find that out, simple programs were made to transmit and receive long 
messages (arravs) through the physical links. The transfer rate was obtained by dividing 
the number of bits transmitted by the time spent on the transmission. A flag was used 
(single byte) from receiver to transmitter to assure the transmitter would only transmit 
when the receiver was ready. This way, we would be timing the best possible case with 
the best possible accuracy. The basic program code used for the transmitter and 
receiver 1s in Figure 3.2 The BYTE SLICE was the construct used. because from the 
very first tests it proved to be the fastest, even for one byte being transmitted. 

The configuration used for that was as simple as it could be. Two transputers 
connected by a link hosting one procedure transmitter (TR.1) and one procedure 
necerer ( PR.2). Figsure 3.3 depicts that. 


SEQ 
Guiana ¢ flag” --- flag is received 
TIME ? startime --- time is stored in var startime 


--- transfer begins 

EoueesiiCe.OUleUl (echanZ, buffername, 1, block.size) 

--- transfer ends 

TIME ? endtime --- time is stored in var endtime 

--- call to Pyecedunemeranstem Fate outputs the rate. 

transfer.rate (startime, endtime, transputer.type, 
blocksize, rate) 


a) Transmitter 


SEQ 
Soo eeecidt stag ts Sent to transmitter  . 
BYTE.SLICE.INPUT (chan2, buffername, 1, block.size) 


b) Receiver 


Figure 3.2 Basic Code for Transmitter and Receiver. 


The block size used was 15,000 bytes, in order to avoid possible dragging 
effects of small messages. The results obtained were around 3,800 Kbits/sec with an 
execution time of 31.5 msec, average. As we can notice from table 4, in Chapter ITI, it 
was almost half of the expected value of 7,273 Kbits/sec. Why? The monitoring of the 


channels with the oscilloscope answered this question. 
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Figure 3.3 Configuration for Initial Tests. 


a. Links at 20 mbits/sec 
With the links switched to 20 mbits/sec, we could only have 
communications between transputers located on BO003 boards, so, although the code 
was practically the same, the configuration had to be slightly different. Figure 3.4 


shows us how it was. 


(BOO1) (BOO3) 
a eeeare tosh Paar Cc a ee 





Figure 3.4 Configuration for Measuring Links at 20 mbits/sec Bit Rate. 


The results obtained for block sizes of 15,000 bytes using also the BYTE 

SLICE construct, where of the order of 6,000 to 6,100 kbits/sec, again very small, if we 
compare them with the expected of 14,545 kbits/sec. 

2. Using the Oscilloscope 

Another simple program that made a continuous transfer on the link, made it 

possible to observe the frame transmitted and estimate the rate on the oscilloscope 

screen. The message transmitted, using BYTE SLICE, was a sequence of TRUES. The 

TRUE, in Occam, is a sequence of 8 binary 1’s and so the frame was as Figure 3.5 


shows. 
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stdigies ee information bits( TRUE) IStop, Dit 





Figure 3.5 Frame Transmitted for Oscilloscope Observations. 


The basic code used is depicted in Figure 3.6. There is no time sampling or 
flags to avoid any side effect on the oscilloscope screen. Figure 3.7 shows the picture of 
a frame like the one on figure 3.5 followed by an acknowledge (both appear on the 
Same trace due to vertical mode ADD used on the oscilloscope. All the oscilloscope 


settings are also mentioned below the picture. 


WHILE TRUE | 
Eee oeler.OUIPUL (chang, buftferige 1, block.size) 


a) code on the transmiter 
KRRKKARKKAKKKKKRKKKKKKKKKKKKKKEKKKKKKKKKKKKKKKKKKERKKKKKKKAKKK 


WHILE TRUE | 
Ewe voulce. INPUb@Mcnanz, busser2, 1, bitock.size) 


b) code on the receiver 


Figure 3.6 Example Code for Oscilloscope Observations. 


Note from Figure 3.7 that the 10 “ones” of the frame occupy 5 divisions. This 
sums up to | microsecond. So we have one bit per 0.1 microsecond and this implies a 


bit rate of 10 mbits/sec (gross). 


Conclusion 1} 


The bit rate is in fact 10 mbits/sec, 


if we consider only one frame. 





4) 





7 ee S 


Oscilloscope Ssecemics. 


| channel 1 --> shows the transmitted frame | 
channel 2 ~-> shows the acknowledge 
time scale ==-97 0, 2 MBPerTOSeC 7 Guster 


Voltage scale =="Z volts 7 divytsicee 
vertical mode -> ADD 
trigger source-> channel Il 


a 
Figure 3.7 Picture of Ome" Framerate (Umbits ce are: 


The Acknowledge appears enlarged due to the lack of synchronization 
between both channels and the trigger source to be oscilloscope channel |. We can't 
take precise measurements, but we can estimate the best and worst cases: 


¢ best case - The ACK pulse is beginning at the trailing edge (leftmost) of the 
ACK frame. This will give us a distance of approximately 200 nsec between the 
last bit of the data frame and the acknowledge frame (remember that there ts a 
“FetO Bit alter the lasts omen 


* worst case - The ACK ends at the leading edge (mghtmost) of the ACK pulse. 
This will give us a distance of approximately 300 nsec instead. 


Conclusion 2 


The ACK frame leaves the receiver 200 to 300 nsec 


after the transmitted frame arrived! 
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"8S Chansel Sefer 


S: | 
channe --> eee the transmitted frame | 
channel --> shows the acknowledge 
te me ae @ ==> ©.> microsec / division 

voltage scale -> 2 volts / division 
vertical mode -> ADD 
eri coer seurce—-- Channel 1 


eee eee 


Pigure- oo 1 atee Mata Proames at 1Ombits sec Rate. 


Increasing the time scale of the oscilloscope to 0.5 microseconds, we could 
observe more frames and acknowledges as shown in Figure 3.8, and from this picture, 
using the same best and worst case approach, we could estimate that the distance 
between the ACK and the following frame (center) is between 500 and 600 nsec. We 
could also notice that the distance between consecutive data frames is between 900 and 
1000 nanoseconds. 

So, estimating the transfer rate from the picture, assuming all frames will keep 


at least this space between them, we got: 


e best case- for each 1100 nsec information we have 900 of line inactive. 
this ratio by the expected transfer rate of 7,273 kbits/sec, we get 


(1100 / 2000) x 7273 = 4000 Kbits/sec 
® worst case - then 1000 nsec of line inactive would bring us 
(1100 ; 2100) x 7273 = 3809 kbits/sec 


If we 


ie ye 
multiph 
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As we can see the hardware results were confirming the previously obtained 
SONA Cue iess 

Another fact to add is that. during our observations, the frames were not 
aways equally spaced as shown in Figure 3.8 In fact, this figure shows the most 
equally spaced results we ever obtained. Figure 3.9, in which the time scale was once 
more increased, to | microsee per divistemmeeemealmmote tater fifth frame mune 
channel at the bottom 1s more spaced than the four previous ones. In this picture the 


ALT vertical mode was used to)perm Usionceastnce )@ieonmcmemmpper trace. \Ote Tic 


‘egularity which the acknowledge appears 200 to 300 nsec after the received frame. 





Ogc* a esas e ape 
channe --> s ae the transmitted frame 
channel 5 --> shows the acknowledge (upper) 
time scale --> 1.0 Hepes j/ division 
voltage scale => Z volts / division 
vertical mode -> ALT 
| trigger source=-> channel 1( bottom) 





Figure 3.9 Five Frames Observed at 10 mbits/sec Rate. 
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a. Switching the Rate to 20 mbits|/sec 
Simular observations were made for the links operating at 20 mbit/sec rate 
and Figure 3.10 that was taken with time scale 0.1 mucrosec per division shows the 
same 1) “cones” of Figure 3.7 in approximately 0.5 microsecond, that is half of the time 
that was cbtained there. The ACK now is in oscilloscope channel | and is the trigger 


source (this is the reason it is now well defined). 





Veep CeCe 4 Settin 
channe “=> Ss ae the acknowledge pee Tous ) 
channel 5 -=-> shows the transmitted frame( next) 
time scale ==> 0.1 microsec / division 
voltage scale => Z volts / division 
vertical mode -> ALT 
trigger source=> channel 1 (upper) 





Figure 3.10 One Frame and the ACK at 20 mbits/sec Rate. 


Note the time delay between the ACK (upper trace) and the following 
frame (lower trace) that was measured as about 400 nsec. 

Figure 3.11, taken with time scale 0.5 microseconds per division shows us a 
series of “TRUE” frames at 20 mbits/sec rate and the ACKframes in the same trace. 


We could estimate the percent of time the link is actively transmitting as around 40% 
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of the total time approximately. If we take 40% of the predicted rate of 14,545 we get 
5,818 kbits sec. Comparing this with the software obtained value of 6,100 kbits/sec, we 
can conclude that the value is reasonable enough for an explanation of the software 


results. 





OSE eee Peper oe: 

channe --> shows the acknowledge 
channel 2 --> shows the transmitted frame 
time scale -=-> 0.5 microsec / division 
voltage scale => Z volts 7 division 
vertical mode -> ADD 
trigger source-> channel 2 


Lr ere —_ 


Figure 3.11 Four Frames and ACK at 20 mbits/sec Rate. 


3. Comparison Between the Constructs 
In this subsection we would like to include the maximum values of the transfer 
rate obtained, ever for the various constructs. They are summarized on Table 6 and 
were Obtained using different programs, with different message sizes and so they are 
not mentioned. I[t is interesting to note that the input operation has a slight tendency 
to be quicker than the output, which 1s not true. This occurs because of the flag 


positioning, which will slightly affect the rate, but the rate should be considered as the 
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Saiicwe watican be mentioned, however, is that [Ref. 7: section 2, pp.26,27], shows us 
an expected performance summary and there the input primitive is rated as using 26.5 
processor cycles while the output would take 26 cycles, and this is not much of a 
difference. This same reference still mentions that the values are not definitive and 


may suffer changes as more information is collected. 


ASE AG 
MAXIMUM TRANSFER RATES OBTAINED (KBITS/SEC) 


HopuL/ Output BYTE SLICE |WORD SLICE 
een Caves procedure procedure 
bytes words 





Browsing the figures on Table 6, one question comes up at once: 


“ Why is the transmission using the input and output primitives, so much 
slower in comparison to the built-in procedures?” 

For the byte transmission case, using the primitives, if we look at Figure 3.12, 
we will see how an array of “TRUES” is transmitted through the link, at 10 mbits, sec 
selected bit rate. The information seems to be stored one byte per word and this way, 
for each “TRUE” byte, three emptv frames follows. Note that the frames carry only the 
start bits (two “ones”’). The time between frames containing information, measured at 
the lab was 13 microseconds. 

For the word (integer) transmission case, if we browse Figure 3.13, we see a 
similar pattern to Figure 3.12 but with the difference that all frames are effectively 
carrying information bits. The information used to ease the observation was maxint, 
which is, for our 32 bit machine 2,147,483,647 decimal or ” 7FFF ” hexadecimal.’ The 
elapsed time measured at the lab between the acknowledge of the last byte of the first 
word and the first byte of the second word was around 5 microseconds. By doing same 
calculations done for the BYTE SLICE case one will conclude that the maximum 


values obtained are in accordance with the observations on the oscilloscope. 


”The transputer T 414 uses signed integers in the range - 2,147,483,648 to 
2,147,483,647 decimal or 8000 to 7F FF hexadecimal, respectively (Ref. 7: section 2, p. 
2]. 
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Oe eee Se eee 
| channel 1 --> shows transmitted frame (upper) 
| channel 2 --> shows the acknowledge (lower) 

time scale --=> 2.Q microsec / division 

voltage scale -> 2 volts / division | 
| Verticals nodes=—2 ean 
| trigger source-> channel 1 | 

{ 


Figure 3.12 TRUES Transmitted Using the Input/Output Primitives. 


As a conclusion of this section, we could prove that the software 
measurements and the procedures used to calculate the transfer rate were producing 
reasonable values, that agreed with those observed on the oscilloscope. The reason we 
had not obtained the expected transfer rates was because the link is not continuously 
active as the literature led us to believe, and there is a considerable delay between the 
receipt of a frame and the departure of the corresponding ACK. Also, after the ACK 1s 


received by the transmitter, there is another delay to transmut the next frame.!° 


lOrn fact during the Occam User Group meeting alreadv mentioned, in Santa 
Clara, CA, Mr. Martin Booth from INMOS officé at Santa Clara said that the data 
rate we should really expect on the links was 450 kbytes/sec, what agrees with our 
results (450 x 8 = 3800 ). 
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eect ee oe Boe aoe . 
channel 1 -=-> shows transmitted frames (upper) 
channel Z2 --> shows the acknowledge (lower) 
time scale ==> 2.0 microsec / division 
voltage scale -> 2 volts / division 
vertical mode -> ALT 
trigger source-> channel 1 | 


a 
Figure 3.13  Maxint Transmitted Using the Input;Output Primitives. 


Although, it 1s expected, that the new transputer version, the T 800, will solve 
this problem by permitting the acknowledge leave the receptor, as soon as the first bit 
of the frame arrives, and this way the delay would not exist, or at least be smaller 
Pe t.2 1]. 


C. OBSERVING PARALLEL ACTIVITY ON THE LINKS 
1. Using Software 
To observe the links working at the same time, we needed to build a different 
configuration. As we have 4 links per transputer, we needed at least 5 transputers to 
make all jinks work in parallel at the highest possible rate. The configuration used 1s 


depicted in Figure 3.14. 
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CHAN C1J 





Figure 3.14 Configuration to Observe the Four Links Operating in Parallel. 


In processor root we placed a procedure that was able to transmit and/or 
receive in parallel to/from the four transputers 0,1,2 and 3. Figure 3.15 shows the 
Occam code used to do that. 

Using Figure 3.15 as a template, one can implement a similar code to transmit 
in 2 or 3 channels in parallel just by removing the unwanted BYTE SLICE procedure 
calls. Note that the channels mentioned on Figure 3.15 are in accordance with the ones 
on Iigure 3.14 (1,3,5,7 are output channels for transputer root). On the cther handseme 
may think of using the input channels at the same time, but this will be addressed in 
next section. 

The receivers in their turn have a simpler code than the transmitter, because 
cach one of them is only communicating with transputer root. Figure 3.16 shows it. 

If one tries to map the channels of Figure 3.16 to the configuration, one will 


notice that there is no such a channel in or out on Figure 3.14 and to clarify that 





PROC transmitter (CHAN chan0O, chan2, chan4, chan6, 
chanl, chaas, chans, chan? )— 


| -.. declarations 
SEQ . 
. buffers initializations | 
PAR 
| chanO ? flag0O --- flags are received from each | 
chan2 ? flagl --- of the receiving transputers 
chan4 ? flag2 To eae on after all of them 
chan6 ? flag3 “--- are ready the timer is started 
ae Cao cote ne 


Erle. SLLGCE OUTPUT (chanl.. bufter®, block.size 


block.size 
block.size 


Berean scl TCR.OULPUL (chans, i bukterZ. 
Prue, oLlGh.OULPUL (chan?, buffer3. 
TIME ? endtime 
--- transfer rate calculated will be in VAR rate 
transfer.rate (startime, endtime, 1, blocksize, rate): 


My, 
Brlecoll@hwOULPd) (chans ,.woutter:, §. block.size 

ae 

ily, 





Figure 3.15 Code Used to Time Transmission Through the Four Links in Parallel. 


PROC receiver (CHAN in, out)= 
... Geclare variables 

; initialize buffer 

SEQ 
Out ee lag | ; 
BYTEeshI Ce @NPUT©(in bucber, 1) block.size); 


Figures '6, Code for the Receivers. 


Figure 3.17 shows how the configuration would be actually coded for these processes 
to be mapped and work properly. 

As one may notice from Figure 3.17, the chan0O inside the procedure refers to 
the chan{0] on the configuration, and so on. We could think of chan{0] being the actual 
parameter and chanO being the correspondent formal. This is not strictly true, because 
on the configuration we are onlv placing the procedure on the processor, not calling it, 
but the analogy is still valid and the names were chosen to make it easier to 
understand. The users and programmers may use any name for channels, and in fact 


we used some different ones in our implementations. The importance is to get the idea. 
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CHAN chan[8]:--- channe 
PLACED PAR 


oe configuration 
DEF root = 100:--- ols Sao a number to root 


variables for physical channels 


PROCESSOR root 
--- placing channel names sonw eis catechanneats 
PLACE chan]0 | ABelank0 ieee 


PLACE chanli!] AT linkOcute- 
PLACE chan! 2|) AT linkin 

PLACE chan/[3} AT linklout : 
PLACE chan|4] AT lamnkZunee 
PLACE chan[5] AT link2Zout : 
PLACE ehani.6)| AT ohimks tage: 
PLACE chan 74 AT linksout 


--- placimg™ the preceduremeem es. ete on the processor 
transmitter (chan[0], cham Ser : chan lel. 
chan[1], chan[3], chan chan 


PROCESSOR Q 
PLACE chan[0O] AT linkOout : 
PLAGE chan! Fl AP iinkoan 


receiver (chan[1], chan[0]) 


PROCESSOR 1 | 
PLACE Chan | Zia tenkOetee: 
PLACE chan{3] AT linkOin 


receiver (chan[3], chan[2]) 


PROCESSOR 2 
PLACE chan[4] AT linkQOout : 
PLACE chan!) 5) 2Pwtrnke an 


receiver (chan[5], chan[4] ) 


PROCESSOR 3 | 
PLACE cpants| AT linkOout 
PLACE chan[{?7] AT linkOin 


receiver (chan[7], Pie 


Figure 3.17 Configuration Code for the Link Evaluation Program. 


Using the program described above, the results obtained for a block size of 


s00 bites ence 


parallel, but there is nearly linear improvement in communication performance due to 
parallelism, because the overall data transmission jumped from 3670 to 14520 ( 4 x 
3630 )! 


| channel] Ae 3670 kbits/sec 
2 channels im papallcle- 3670 kbits/sec(in each channel) 
3 channels in parallel ..... 3650 Kbits/sec(in each channel) 


4 channels in parallel ..... 3630 Kbits/sec(in each channel) 


These results show a slight decreasing performance as more channels are in 


OZ 


2. Using the Oscilloscope 
As had happened with the initial observations related on the previous section, 
the programs used for this observation where adaptations of the ones just presented 
using the WHILE TRUE construct to permit continuous transmission. and taking off 
all timing and flags, so we will not repeat them here. As we know, the maximum we 
could monitor at one time, was two channels. Two observations were then made: 


¢ two channels of different links transmitting in parallel (Figure 3.18), 


¢ two channels of the same link transmitting in parallel (Figure 3.19). 





Oscilloscope, Seed . 
channe --> shows transmitted frame yepeo 
Channel 2 e--> shows transmitted frame (chan3 
time scale --> 0.5 microsec / division 
voltage scale -> 2 volts / division 
vertical mode -> ALT 
trigger source-> channel 2( lower) 
storage mode used 


EE ee 


Figure 3.18 Two Channels of Different Links Transmitting at the Same Time. 


In the first case, Figure 3.18 shows the same frame used on the previous 


section (TRUE) in two different channels and one may notice how they overlapped. It 
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is Worth emphasizing that the interval between frames is due to the acknowledge delay 
explained in last section (not shown here), and the different phase between the wave 
forms is due only to the communication processes had begun at different absolute 
times. This implies that, as the links have the same speed, and as the procedures are 
the same. this difference of phase is kept constant as long as the processes are running. 
This photograph was taken in storage mode due to the fact that in normal mode the 


unsynchronism between the channels did not permit us to see both waveforms clearly. 










| *8eN channel J See 

| channe --> shows transmitted frame (chanl) 
| channel ~-> shows received frame (chanO 

| time Sais s=> Of anilcrosee 7 seivisi on 

| voltage scale -> 2 volts / division 

| vertical mode -> ALT 

| trigger source-> channel 2( lower) 

storage mode used 


Figure 3.19 Two Channels of the Same Link Operating at the Same Time. 


In the second case, Figure 3.19 shows two channels of a same link operating 
at the same time. This picture was also taken in storage mode for the same reasons just 


mentioned. Note that at this time we can observe the acknowledges piggy-backed on 
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the transmitted frames. The ACK on the upper trace are sent for the frame been shown 
on the lower trace, just a little before in time. The reverse is valid for the lower trace 
ACK. Note that at the time the picture was taken, all eight channels were operating in 
the same way. 

3. Using the Logic Analyzer 

As mentioned in Chapter III, a snapshot of parallel operations is not easy to 
get. Our Model 532 Logic State Analizer could store up to 250 words of 32 bits each 
monitored by 2 Logic Probes of 16 bits each. We used only one of the probes once 
mionitoring 4 channels, and another time 8 channels. In the first case, monitoring 4 
channels, three of them were carrying data frames transmitted by different links ( the 
handy “TRUL’s), and the last one carrying the acknowledge of the fourth link. Figure 
3.20 is a reduction of the printout obtained from a representative part of the 250 
words. Remember that the links are serial communication devices and the probes are 
more effective When monitoring parallel buses, specially if there is a clock available on 
the bus (synchronous buses), and so the sequence of “ones” appear vertically on the 
picture. The program being used was the EVALCONTTRUE.tds, using the same 
configuration depicted on Figure 3.14, and the channels monitored were chanl, chan3, 
chan5 and chan6. The program was transmitting continuously blocks of 15000 bytes of 
trues by the four output channels (1,3,5,7) of transputer root. The first column is the 
memory position of the logic state analyzer. In the second column bits 4 and 8 (from 
left to right) carry respectively chan! and chan3. The third column bits 4 and 8 again 
refer to chan6 and chans5, respectively. 

Figure 3.21 as well shows us one representative section of the samples 
collected by the logic analyzer from eight channels distributed as Table 7 shows. 

The “one” bits that appear in the other columns are probably cross-talk due to 
the probe being made of parallel wires, while the [NMOS links are always in twisted 
pairs. It is also good to reinforce that when the links are transmitting and receiving in 
parallel, the acknowledge appears piggy-backed, as Figure 3.19 shows, and in the 
sequence of bits We can notice them very clearly in several spots. 

Although, by the evidence from both the logic analizer and the oscilloscope, 
we are sure that the channels indeed operate in parallel, it would be more satisfying to 
obtain data which more closely coincides with the measurements taken by software 
experiments. Our suggestion is that monitoring the channels with a logic state analyzer 


which can handle a faster clock, would enable a more exact measurement of 
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Figure 3.20 Output from the 


VOUUUUUY 
ers rcesig 
Vnisiuu 
OOuniaIac 
UII 
Motta cio 
Coldest sepa) 
CercictCiusog 
VWIHUNNUY 
YUUCHOtd 
VOUIiNIO I 
Wiekeielelarem| 
VYVuumnIounu) 
oH900UoGGd 
vuuuNOUO) 
Owaougud 1 
QUNW0001 
voOuttdud I 
OuvuV00U 1 
GOoDUvOLIO) 
ONvONVONN 
OuvaGuAUuKd 


QOu1IWU00N 
GoaKvgcuad 
Oiwyi0I0 
CumiagnmoG 
YuuIunNOU 
Cuccmugugd 
OQ NUUUN00 
CoawuOGOGs 
‘OTPINIPIPI SL OLe) 
Caucadug t 
YNUUNN00 1 
vuUuUUY J 
YVUUUUUU 1 
Oeugacuuida 4 
OvvHNOY 1 
wVoOuUdUgG I 
YumMIvvdo01 
ougtniou | 
QuuHvvN 1 
wWwOCGuUtCOYU 
YVONUOUOU 
VO MmUcUG 


Overt 
VOUGOWUG 
OnwHIONY0 
Chu GUO’ 
OnWII300 
Outs itO 
OvVoYUWOOOU 
GuunucVUyT 
YVOUUNUI 1 
OlOuUTO I 
YQuuUuie 1 
YUuUugGoNn) 
YVUNUUUH 1 
GuowiGG I 
OUNWONU I 
(2O6COulatt) § 
OuUUUUIO 1 
Oc 10 
Orbis, 
Gugcguauad 
OU 1 1ut)IDs) 


-UUUGUOOO 


Logic Analyzer of 4 Channels in Parallel. 
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Figure 3.21 


0000 
0001 
0002 
0003 
CO04 
0005 
00086 
0007 
0008 
0009 
0010 
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0012 
0013 
0014 
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0620 
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0024 
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0026 
0027 
0028 
0029 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
00490 
Oo0o41 
0042 


0042 
0043 
0044 
OO4S5 
0046 
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0051 
00Se2 
0053 
0054 
0055 
00356 
0057 
0038 
0059 
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O00€e2 
0063 


8 Channels Monitored with the Logic Analizer. 


10110000 
10110011 
10110010 
101100190 
101100140 
00100011 
00100011 
00100011 
00000011 
00000011 
10100011 
00010011 
00000001 
00000001 
00000001 
10011000 
106011001 
10010000 
11111010 
10110010 
10110000 
10110010 


10110010 
1TOVTOT 10 
10110010 
10110010 
00100011 
00100011 
00100011 
000000114 
00000011 
11100011 
10010011 
00000001 
00000001 
06000001 
00000000 
00000001 
10000000 
11100010 
10100010 
11111000 
11111000 
10110060 


104110000 
10110000 
10110000 
10110001 
10141100141 
00110111 
00010011 
00010014 
00000011 
10010011 
00000011 
009000011 
10000011 
11100010 
00000010 
00000000 
11100010 
10100010 
10100000 
111776004 
10110000 
10110000 


00000011 
01110011 
00000001 
00000000 
00010000 
00000009 
00000001 
00000010 
000060000 
11100000 
01100000 
01100001 
009100010 
00111010 
00110011 
00110011 
00110011 
00110011 
00110011 
00010011 
00010011 
00010011 


00010011 
00010001 
00000001 
00010000 
00000000 
11100001 
00000000 
00000010 
01100000 
01100000 
00100001 
60100016 
00110010 
00110010 
001100114 
00110011 
00110011 
00110011 
00010011 
00910011 
00010011 
00010001 


00010001 
00000001 
00010001 
00000000 
41100001 
090000000 
00000000 
00000010 
00000000 
60000000 
00000010 
09010010 
TOTO s1O 
00110010 
00110011 
00110011 
00110011 
00110011 
001100114 
149106014 
00110001 
00100001 
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00000000 
00000000 
00000000 
06000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
O0CO000C00 
00000000 
00000000 
000000060 
00000000 
60600000 
00000600 
00000000 
00000000 
00000000 
00000000 


00000000 
00000000 
060000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000060 
09060000 
00000000 
00000000 
00000000 
60000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 


00000000 
00000000 
090000000 
00000000 
00000000 
00060000 
00000000 
00000000 
00000000 
00000000 
00000000 
09000000 
00000000 
009900000 
00000300 
00000000 
00000000 
00000060 
00000000 
00060009 
00000000 
00000000 


09000000 
00000000 
00000000 
09000000 
00000000 
00006000 
00000000 
00000900 
0000000090 
00000600 
00000000 
O0o000000 
00000000 
00060000 
00000000 
00000000 
00000000 
00600900 
00000000 
00000000 
00000000 
OOG00000 


00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
60000000 
00600000 
00000000 
0900900000 
oo00co0n0 
00000000 
00000000 
00000000 
00000000 
00900000 
00000000 
00006000 
00000000 
oo000000 
00000000 
00000000 


00000000 
00000000 
00000000 
00000000 
0000000600 
00006000 
00000000 
00000000 
00000000 
00000000 
00000000 
000606000 
00000000 
006000060 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 


TABI 
DES KM ese) egret ie ect 


channel column ace probe lid 
chanO ---<=-> second 3 Dis 
chanl «<---> second 4 Di Z 
chan2 ---=-> second a, D9 
chan3 --<-=-> second 8 D8 
chan@ ==) elder 3 DS 
chan5S \«-**-> third 1 D4 
chan6 -e--> third i Dal 
chan7 ----> third 8 DO 


acknowledge delays and the delays between successive word, and byte transmissions, by 
making timing diagrams of 4 and 8 channels in parallel. This, however, is left as a 


suggestion for future research. 


Conclusion 3 


The Links really are able to operate in parallel! 





4. Comparison Between the Four Constructs 


TAS 


EFFECT OF PARALLELISM ON TRANSFER RAGES FOR M0000 iE. 
BILO@ Ks Zee 


| 
| pte, oUt pine SVvrewo hee | WORD SElC# 
Cece procedure procedure 
bytes words 
1 channel 370 Se) 3670 2670 
2 channels 130 TIO S67 0 3670 
3 channels 160 640 3650 3E50 
4 channels 160 | 640 SU52.0 Seo 


| ale de 


ie Values are in kbytes/sec rounded to tenths. 
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Table 8 shows the results obtained for 2, 3 and 4 links transmitting in parallel 
for each of the constructs. 

These results where obtained using the Link Evaluation Program for all the 
constructs, listed on Appendix E, with no special priority for communications, and 
with no other processes being executed on the cpus, besides the transmitter and 
receiver processes. The time measurements were made at the transputer root at the 
BOO! board. It is clear for us that although the BYTE.SLICE and WORD.SLICE 
procedures are not affected for more channels in parallel for this block size, the input 
and output primitives indeed are, but this will be addressed in the next section. It is 
still worthy of mention that several attempts were made to increase the transfer rate of 
the primitives input and output by using different loop sizes, no loops at all, different 
number of bytes, or words after each ? or ! separated by colons but in none of these 


cases a significant improvement was noticed. 


D. MESSAGE SIZE AND CHANNEL PARALLELISM INFLUENCE. 
Once we overcame the first phase of the research, validating the software we were 
using, we moved our attention towards the fourth and fifth research questions: 
e What is the effect of message lengths on the link transfer rates? 


e What is the mutual effect, on the link transfer rates, of more links operating in 
parallel. 


To address these topics, The Link Evaluation Program was designed and 
implemented, using the programming concepts presented on previous sections of this 
chapter. What it does basically is, after the user’s choice of type of construct and 
existence or non of concurrent process running on the CPU of the communicating 
transputers, named “cpumode”, it builds a Table showing the transfer rates for the 16 
different message sizes and 9 different channel parallelism cases, for the chosen option, 
and prompts the user for a new run. Appendix FE presents the program, written in 
Occam, but one doesn’t need to understand the program to grasp the results obtained, 
that will be presented in the following subsections, and in the next chapter. The 
configuration used for this program was the same of Figures 3.14 and 3.17. 

1. How to Read the Tables 

The tables have ten (10) columns as follows: 


¢ BYTES - Shows the number of bytes transmitted for the results obtained in that 
row. 


¢ 1 OUT - Results obtained measuring transmission through only one channel 
from root to transputer 0. 


oy 


e 1 IN - The same as above for reception on the root from transputer 0 


e 2 OUT - Results obtained measuring transmission in parallel through two 
channels from root to transputer 0 and transputer 1. 


® 2IN- Same as above for reception in parallel. 


e 3 OUT - Results obtained measuring transmission in parallel from root to 
transputers 0. | and 2. 





* 3 1N - Same as above for reception in parallel. 


e 4 OUT - Results obtained measuring parallel transmission from root to 
transputers O71) 2 and: 


e 4IN - Same as above for reception in parallel. 


e 4IN/OUT - Results obtained measuring transmission and reception in parallel 
to/from transputers 0, 1, 2 and 3, using all 8 channels from the four links that 
eXist 1n One transputer. 


TABLE 9 


TRANSPUTER LINK TRANSFER RATE 
BYTE SLICE (1) = NO CONCURRENT PROCESS] IU WiBIES sre 


BYTES..1, OD eee! Z OUT 2 ih COU tower) 4 OUT 4 IN 4INOUT 
i 525 616 250 ZO 200 nS ie a fou 98 


2 Zed LZS 4 500 500 400 400 S25 533 196 
= toot ZL oU tag 1000 648 788 650 646 384 
8 Ze. 2811 1570 1582 ea LO 1085 HOg6 (670 
16 Zn oe 2o24 Zar Zaee 1948 ISLS, OZ 1694 1255 
S72 3224 3246 Zed 2300 2482 2544 2330 2398 pleas 
64 3427, 3646 olts S220 2942 3048 Za07 2954 2462 


128 3543 3644 8382 3497 3265 3320 3187 3320 2945 
256 3605 3741 3496 3656 3444 3596 3398 3556 eZzel 
Sle Soap oye 2570 S732 2555 3697 3509 3677 3401 
1024 3650 3754 S620 3741 3604 S7IZ 3575 8702 we SoZ 
1280 3654 3748 3640 3742 3611 278 8587 3698 3529 
2048 Bene 3740 352 3738 Seg 37s 3604 3703 3549 
4096 3662 37 35 3665 S755 3634 S720 Sole 3709 =S576 
8192 3665 SieZ 3668 37a 3645 37a Sog7 E714 3505 
10000 3667 BF 3 3669 3730 3647 By ce 2623 37 lime Shen 
* Values in kbits/sec 


Some tables, though, have three different columns labeled: 
e 1IN/OUT - instead of 1 IN 
e 2IN/OUT - instead of 2 IN 
* 3 IN/OUT instead of Sain 
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In this columns, as the reader may have guessed already, the results presented 
efer to the transputer root transmitting and receiving at the same time through the 
number of links specified. 

For each of the constructs the results are presented in table format and when 
necessary a graphical representation of the table. Each individual number on the tables 
1s the average of 20 sequential runs. The results are in Kbits/sec due to non availability 
of floating point for Proto-Occam and our need for precision. 

2. BYTE SLICE Procedure 

Table 9 and 10 show us the transfer rates obtained for this procedure with 

communication being the only process being executed by the transputers involved. 


Figure 3.22 1s a graphical representation of Table 9 . 


iD CEIG 
TRANSPUTER LINK TRANSFER RATE 
Bae sUICE(2)max® CONCURRENT PROCESS = 10 MBITS/SEC 


BYTES 1 OUT IINOUT 2 OUT 2Z2INOUT 3 OUT 3INOUT 4 OUT 4 IN 4INOUT 
il B25 248 ZOO 166 Loe ZS 161 166 vo 


a Zo 0 500 900 338 400 245 B22 S55 136 
Is) dere) 1 I8 7TT9 Gol 645 437 645 648 334 
3 Ae ise SG 7 O99 PS Son ioe LOIS 689 
in Pe Zio 2208 ele 1S 1458 1701 1714 ae 
a2 B27 Zo=5 Zi OZ 2325 2508 Zoe 2314 2319 ios? 
64 Salo 3067 S116 2850 2988 Zo07 Zeus Zoho 2741 


128 S576 S350 3368 3204 S505 S075 3186 3520 2926 
Zo S620 3491 S522 3417 3476 SoS 3404 S556 S250) 
52 S555 3574 3601 S532 6576 3483 5516 Se62 3404 
1024 3584 Sor? S651 Sooe Shsz2 S506 S5ok S065 3487 
reou 3687 3624 S657 S591 3640 3584 S595 3685 S50 
2048 S692 2656 S672 S537 S653 3603 S617 3689 3543 
4096 3694 3648 Spas 3616 S671 SoZ 3624 3692 S576 
8192 3698 S655 3690 Soro BO Lo SosZ S652 3694 S596 
10000 3699 S600 So9Z 3614 3679 3638 SoZ 3696 3597 
* Values in kbits/sec 


From Table 9 we can notice the overall tendency of input be quicker than 
Output. due to the way the timers are started and stopped by the flags. The flags used 
in this and all following tables were placed from the BOQ3 transputers to the root. 
When we had the flags inverted the values had a tendency to be bigger for the output, 
so one may disregard the difference. For this reason we tried to show most of times the 


4 


values for “in/out” instead for “in”. Most important, however is the effect of message 
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size and channel parallelism reducing the transfer rates sensibly for smaller message 


te 


sizes, but with insignificant effect for message sizes above 256 bytes. 


Sy a ie a ep mas eee ae, as wang se Fe es rc a ce en 


| t 
rE SN imme a 2 eestor 9 ee 


_—_——_ — 


' 

4 t 1 

1 I 1 

J t t 

' t 1 

1 1 t 

t t { 

t I 1 

4 I t 

t j 1 
Soaks so NS esas 

1 1 ' 

{ 1 1 

{ t 1 

t i 1 

t j 1 

1 4 t 

1 t 1 

j j t 

1 j 1 
ee = eee ee 

1 | 1 

| 1 1 

t t 1 

1 ' t 

1 t 1 

1 } ' 

1 t ' 

t t t 

! I | 
= joao 2 aan oo 

4 | J 

1 t 1 

t t j 

' 1 1 

t ( t 

1 t 1 

{ t t 

1 t ! 

1 ! t 
Slee Ei Se dia a eae 

{ 1 4 

| | t 

t 4 | 

1 I j 

1 t t 

1 I 1 

1 I 1 

1 I t 

4 4 i 
Mead as) oxi ham iest eh antes eee 

~ oo = St = 
=f en 
agence am “a ss = . 
hE a oe yee Bearer alae w, 

I 1 | 

' ' 1 

1 t ' 

1 1 | 

! { i 


(Sara 
Cot 


aZ lS J06see0 





Lg ei 4 ' (ei 
1 ioe' 4 1 1 
1 1 t \ ! on 
1 ft 1 t 
Sitcmiae, ot , Ye et 
i ii st : bie t ae as 
ae re! ' ' een 
Raina oo 1 ' Vet ee 
Vy : \ ' yt 9 
bay ' t ! 1 i! 1 4 ' 
os ea : ' aie! cae ad 
a 4 pigs) eam =~ = a = i , 2 ba ' 
Le err ' { 1 Poa t ' 
calle erp mul ' ' i es ee, 
ty ie t 1 1 Paya j \ 
a cael ' ' ie ano 
oe ieee \ t ae Le 
; j a4 t ! ' 4 ’ toe t 
Risidnc Ayer ' \ Llane 
y ' ' \ J cob A 
1 tent I ' 
ore (ees ' 1 1 ee. 1 4 
pity gt opal > em cl por alana al am, 
ps to4 1 ' j I jets \ 
ry yr eee ' t ae aa 
pie i t j 1 os i t 
pea ry ! J J Seea \ ; 
Lie eel ' De anlane 
ty ! 1 ' ' ot 
{ You. 1! J 
(i. ioe : : eee ee 
oe t 1 1 Mt) ' 
ea a ’ j \ \ teens 
ee 7 seas 
i, (i 1! ei 
' rtf 1 1 ' i ft sot ' 
ant 1 ‘ | \ 1 yt I ne 
uy 1 ' 1 (ic paelta ie 
1 or | \ ' 
t j t 1 ' ipeeaeee 
iy (ft ' 1 1 
' \ 1 i-x | 
laa 1 ' ’ ee t 4 P 
aie ‘ere ' \ Ne ta I ; 
tela ie a ' ' iy 4 f.. 
Vy ce 1 oy i (re A 
mae ae, SS ea ——_ = = = Aiaas a 
tee al ' ' Zz ha ye 
1 ie | t ' be a : 1 rr ac Pp 
' borog ; oO) : = 
ee — eee ' 
ate |! 1 s 
1 : toto : : i j 
| aa Ds _ 
tae see lest cain \ Hees t \ 
. ae Bae eee Seas 
ened 
ate oe 1 
eke eam ee 7 ee 1 
Seay ge - \ t 
you he ' PG (oe : 
t 
o rg f | eh ; 
| 14 ' ' 1 ltd 
u j 1 | 1 
1 : \ : \ ot 
' 
UP | | | | 
t, 14 1 1 Lats j \ ' i 
' ' i \ \ at) 


Rp 


ike 


CN393 


NYHOP 
NINGEIS 
LMOWEHOe 
of NINGEHOC 
HENS S. 
NINSHDI 
LQONeEOT 





a 


<Jj t+ 


O}O: 


Or BOZ 


O0I¢ 
CJGS/S Ia aia eee oo 


OD82 


DO0CF 


Figure 3.22 Transputer Link Transfer Rate 
Slice - No Process in Parallel = 10 umbics see 


Byte 


62 


3. WORD SLICE Procedure 





TABLET 


TANS Ze LER we NieirRANSPER RATE - 
pro SLIGE= ~O GONCURRENT PROCESS.- 10 MBITS'SEC 


Saree 1 OUT 1. IN Z,0UT” 2 INS OUles IN 4 OUT 4 IN 4INOUT 


8 EOL Z2ou3 roo 13550 LOGS 1061 o3e 890 540 
16 2580 oe25 Ee 1996 1667 1682 1454 1466 998 
32 3083 Son ft 2541 2588 2284 2300 2077 2250 esZ 
64 Soa S209 2956 S070 2650 Zeto 2647 Z1 20 2256 


iZs 3491 3679 3294 3406 3156 3278 3068 S2u6s 2183S 
256 SoZ 3/36 3492 55392 3401 3522 3339 3485 31438 
Siz Soy ower gp! Soe 3707 3538 3662 3479 3634 8575 
1024 3644 3754 3624 Sf35 e5e0 3694 Gis) 5) 7! gor? 3486 
1230 39048 3739 3638 3/730 3594 S652 S570 35680 35116 
2048 SO5) 3740 3648 3734 Sous 3708 3595 3694 3546 
4096 3002 Siss 3659 3431 $653 3714 3609 3795 S560 
Sig2 3664 S750 Sopo SO 3642 3749 S621 Sk 3586 
10000 3666 S750 3669 S72 3645 Sus S522 3714 3566 
* Values in kbits/sec 


f 
3 eo 1868 666 eit Jos 625 452 on 294 





As seen in Table I1, the results obtained from WORD SLICE are very similar 
to the ones we had got for the BYTE SLICE procedure, so if the reader wants, he or 
she may use the same Figure 3.22 to have a better feeling of what these numbers 
means. All comments made for the BYTE SLICE procedure are valid also for WORD 
SIELCE. 


Conclusion 4 
Message size has a major effect reducing the transfer 
rate for block transfers (BYTE SLICE and WORD SLICE). 


4. Input and Output Primitives 
a. Transmitting and Receiving Bytes 
Table 12 shows us the results using the primitives input and output to 
transmit and receive bytes. As we can see, there is no variation as the number of bytes 
increase. This is due to the fact that each byte is transmitted individually as can be 


seen in Figure 3.12 . We can also notice that there is a significant decrease as more 
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PA BIee 
TRANSPUTER LINK TRANSFER RATE - 
INPUT/OUTPUT (BYTES 1) See CO VCR IKKE Nin Orca 
(10 MBITS/SEC) 


| 
BY TES CU at cy 2 OUT VZe Ss Our say 4 OUT 4 IN 4INOUT 
iL 3760 547 haZ Zee 136 L7S Mays) 147 89 


2 370 436 188 222 FS3 131 156 149 89 
4 Si 492 13Z 232 ioc 133 a 149 eo 
§ 373 480 120 25m isD Lee 156 148 89 
16 S78 SC Dom Zou 152 133 es a 148 eo 
32 374 Sit iu 250 13s 16s Loz 148 og 
64 374 504 Toa Zon he) 1&3 Poe 148 89 
ZS 374 506 tou 227 ise Lee oe 148 eg 
256 374 505 ion Zou Loe Les LS 7 148 8&9 
SZ 374 506 197 cot T2 Les Oe 148 89 
1024 374 506 ign 2am 2 IS Sa, 148 89 
1280 374 506 Fou Zo ioe ies ey 148 89 
2043 374 os Tot Zen 135 res oe 148 Be 
4096 374 Se Lor Zl LoS res ile! 148 Bo 
SLoZ 374 510 12d Zo Loo res LBS 148 e7 
10000 374 510 Lou ZS > Fes oy) 148 89 


* Values in kbits/sec 


channels are transmitting in parallel. We mention again that we tried several loop sizes 
or even no loop at all, with bytes separated by semicolons, but the results we have got 
where never significantly bigger than the ones presented. Table 13 stress the 
comparison when both channels of a same link are operating at the same time, 
transmitting and receiving messages. Note how the results on columns 3, 5, and 7 of 
Table 12 are 50% to 100% bigger than the ones from Table 13. 

b. Transmitting and Receiving Words (Integers) 

Table 14 shows us the results for transmitting integers and we can notice 
again that message size does not affect the transfer rate, but more channels operating 
in parallel do. As we should expect from the previous results presented, this rate is, on 
the average, 4 times larger than the one for transmitting bytes. 

Table 15 shows the comparison when both channels of a same link are 
transmitting and receiving at the same time. Again we confirm that, in terms of link 
performance, worse than having two different links transmitting at the same time, is to 


have the same link transmitting and receiving. 
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ABLE 13 


TRANSPUTER LINK TRANSFER RATE - 
Pee OCTPUT (SY IES 2) - NO CONCURRENT PROCESS - 
(10 MBITS:SEC) 


prec 1 OUT LINOUT 2 OUT 2INOUT 3 OUT 3INOUT 4 OUT 4 IN 4INOUT 


iL 370 256 oo doa io 1 iz 156 147 76 
2 370 252 188 149 153 Liz 15% 149 89 
= S70 Zo5 EZ ere 13 LEZ Toe, 149 89 
3 a3 235 190 150 Loe 113 lise 148 eg 
16 Sr3 235 190 ie Los lis Se 148 89 
32 ans 255 L9¢ 150 155 iis 156 148 89 
64 374 255 iol a ie Lis ie 148 og 
SZ 374 Zo el ru ioe 1s 156 148 89 
64 374 250 aot Toa Poe TS 15, 148 89 
as 374 235 1 ou Se is ake ey 148 eg 
Zo0 374 255 Po te Se ris roe 142 89 
See 374 255 £91: 152 LSS 13 L5G 148 89 
1024 374 255 out Io! iS PhS roe 148 89 
LZoo 374 235 Poa 1s ial Ls Tes oe 148 oo 
2043 374 232 LO 151 iS Eis i 148 eo 
4096 374 Zoo 191 Ios! LoD Lis Ly 148 eg 
eloZ 374 Zo0 £3 foo [55 PS E57 148 89 
10000 374 235 Poa 150 ESS Lele 57 148 89 


* Values in kbits/sec 


Conclusion 5 


More channels in parallel has a great reducing effect 


over the transfer rate for all constructs except block 
transfers (BYTE and WORD SLICE), bigger than 256 bytes. 





This conclusion does not contradict Conclusion 3, but reduces the universe 


in which that is applicable. 
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TABLE 14 


TRANSPUTER LINK TRANSFER RATE - 
INPUT;/OUTPUT (WORDS) =SINOv CONGUE Nile Cee Sor 
(LO. MBITS/SEC) 


BYEEsel CULL a! 2 OUT =2Zei Se hoe! 4 OUT 4 IN 4INOUT 
t 1326 2330 169 1000 643 185 628 640 BU 
1491 2369 163 1000 634 770 Gom 634 3H5 
1484 2290 ou 1000 S35 769 Gag 640 an 
1503 2326 (Ga 1000 639 is 629 640 375 
64 1504 Zoen Ted 1003 640 (02 Sip 642 376 
Ze 1503 Zoe 766 1003 640 TZ Sag 642 Sa 
Zee 156g 2366 oe 1004 641 772 G30 642 376 
a2 15 2383 eg 1004 641 Las 635 642 Sa7 
1024 1S09 2see Ved 1004 641 Tis Gee 642 376 
1280 1506 2384 toad 1004 641 774 655 642 Sta? 
2048 1509 2384 Tog 1004 641 774 635 642 oa 7 
4096 1S 2576 Uae 1004 641 774 Goo 642 Sa 
8192 150 2394 fie 1005 641 774 636 643 Se 
10000" 15ee 2394 TOG 1005 641 774 636 643 S77 


(oF 0 
NO OV 


* Values in kbits/sec 


TABLE 15 
TRANSPUTER PENS IRAINS PE Rei es 
INPUT/OUTPUT (WORDS 2) 02 NG CON CURRE TP ROGESsa 
(10 MBIT S; SEC) 


BYTES 1 OUT JLINOUT 2 OUT ZINOUT 3S OUT SSINOUD 4. OUT 41 ies oun 
4 1428 259 769 645 ese 4776 B25 645 Sig 


8 1481 Jog ioe 634 634 470 BZ 634 Sie 
16 1495 Sos fie 6a, 636 470 620 634 376 
SZ ees I69 765 640 640 471 B22 Bo7 S76 
64 50s 969 767 640 640 472 B22 os] 376 


eae DOS Jeg 767 640 640 472 623 So7 S16 
256 P09 pau 767 640 641 473 624 638 376 
DZ 1509 970 767 641 641 473 624 638 30 
1024 1509 one 767 641 641 473 624 638 Sd 
Zoo roo 370 767 641 641 473 624 Bod Sa 
2048 Poo 270 767 641 641 473 624 638 S77 
4096 roLo g74 767 641 641 473 624 637 378 
Sl Z ee oil or 641 641 473 624 638 Sia 
TOCCG™ 41510 a7 a 107 641 641 473 624 So? 371i 


* Values in kbits/sec 
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IV. THE MUTUAL EFFECTS BETWEEN PROCESSOR AND THE FOUR 


LINKS 


When using the transputer in a multi-transputer configuration, most likely it will 


be necessary in each transputer node, at least one process to route messages between 


transputers, and another to execute some processing task. Our role in this chapter is to 


examine how a process task oriented would affect a routing process. changing the 


transfer rate on the links. Also, we are going to analyze how a routing process handling 


large messages may affect the throughput of a computation bound process. 


A. EFFECT OF CONCURRENT PROCESSES OVER COMMUNICATIONS 


1. Initial Considerations 


This section addresses the sixth and seventh research questions as follows: 


e Can the CPU execute a process in parallel with some or all the links operating? 


e What is the effect of a communication independent process running on the 
CPU, over the transfer rates obtained in a link by another process in this 
transputer? 


To observe this effect with the links working at 10 mbits/sec rate, the same 


Evaluation Program was used, but using different program defined cpu modes. Figure 


4.1 shows the CPL modes made available by the program to the user’s choice. 


On~nOOB WINE O 


No concurrent process in the cpus 

BO003 cpus with sum process concurrently (par) 
all cpus with sum process Sone rey (par) 
BO03 cpus with sum process concurrently (pripar) 

all cpus with sum process concurrently (pripar 

BO003 cpus with array product process concurrently (par) 
all cpus with array product process pene unngot ly par) 
BOO3 cpus with array product process concurrently Eee 
all cpus with array product process concurrently (pripar 


Figure 4.1 CPU modes Available in the Link Evaluation Program. 


Two procedures called “cpubusysum” or “cpubusyprod” would be -running 


concurrently with the transmitter and receiver procedures in one or both 


communicating CPUs according to the CPU mode chosen and with the following 


effects: 
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¢ “cpubusysum” - This procedure would initiate at the start of communications 
and execute sum operations continuously, until the communications were 
finished, with few memory accesses involved. 


¢ “cpubusyprod” - This procedure, equally, would initiate at the start of 
communications and execute array products continuously until communications 
were finished. Now 100 times more memory accesses was necessary. 


Figure 4.2 shows the code to permit this (e. g. transmission), fora WORD 
SLICE construct. Similar code exists for the ether comstructsy Only senaneciasmune 
procedure “wordtransfer” to the applicable one. See Appendix E for more details on 


then). 


SEQ Mo >= ia Ne womC cieeemtnonouieis 
seeword Dulfens= neta ligcwron 
ee = [1 FOR maxwordblock.size] 


EQ 
wbufferO [k = 10000 
wbufferl |k = 20000 
wbuffer2 {k| := 30000 
: wolecer sik = 40000 
ar 
cpumode = !2' 
“wordtransfer (repetition, cpumode, flag, counter) 
cpubus ean (flag, counter) 
cpumode = !'4' 


PRI PAR 7. 
wordtransfer (repetition, cpumode, flag, counter) 
cpus yea (flag, counter) 
cpumode oy 


= ' 
PAR aa 
wordtransfer (eps t meron cpumode, flag, counter) 
PRU USE nod (flag, counter) 
Ccpumode = '8'! 
PRI PAR - 
wordtransfer (repetition, cpumode, flag, counter) 
epubusyprod (flag, counter) 
TRUE 
wordtransfer (repetition, cpumode, flag, counter): 


Figure 4.2. How the Concurrent Processes Were Called. 


2. Process Priority Considerations 
The transputer supports two priority levels built in in hardware: 


1 Priority 0 (High) - processes with this priority are executed always, without 
being interrupted until they finish. They should work only for a short period of 
time because if the sum of time spent by all priority processes is greater than a 
time slice, the low priority processes will not be able to proceed [Ret. 7: section 
2, p.3}. The high priority processes preempt the low priority ones. 
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2 Priority | (Low) - These are executed when no more high priority processes are 
able to proceed, in a time slice fashion of 1 msec for each process. 


In our program, considering the two processes to be executed in the same 
CPU, three situations were exainined. Assuming the processes names are for example 
“communication” and “cpubusy” we have: 


a. both processes under a PAR construct - in this case processes will be time 
sliced, because both are low priority, at every | msec. This case was observed 
by using “cpumode” I, 2, 5 and 6 in the Link Evaluation Program. 


b. both processes under a PRI PAR with communications in high priority - in this 
case communication will always be executed at once. Remember that it took 
31.5 msec for a 15,000 bytes block to be transmitted, and the time slice is 1 
msec, and so the cpubusy will not have a chance to be executed if 


communication is going on, unless the number of bytes transmitted is smaller 
than 473: 


e (475 x 8) / 3,800,000 = 0.001 sec or 1 msec, if we considered a rate of 3.8 
mbits,sec. This cases were observed by using “cpumode” 3,4,7 and 8 in the 
Link Evaluation Program. 


c. both processes under PRI PAR but with the cpubusy process in high - in this 
case the communications never occurred because the “cpubusy” process 
although started together with the communications, should be stopped by a flag 
of that process (by design), that could never come, because the process was 
never being granted CPU time. This is why no mention to this case is made on 
the Link Evaluation Program. 


Again, analyzes were made for the four constructs and the results are 
presented in tables and graphics similar to the ones in the previous chapter. 

Another point to mention is that when placing a concurrent process in only 
one of the communicating CPUs, the B003 transputer was the chosen one, because of 
its higher internal clock. When the BOO] transputer was with the “cpubusy” process 
first, no changes were noticed in the transfer rates as we added a cpubusy process on 
the BOO3 transputers. In the way we did, we could clearly see the two step change. 

3. BYTE SLICE Procedure 
a, Using the PAR Construct 
(1) One Transputer Only (cpumode = 1! or 5). 
In this case Table 16 for “cpumode = 1” and Table 17 for “cpumode 
= 5”, shows us the results, and Figure 4.3 is the graphical representation of Table 16. 
We can observe that when the CPU has a concurrent process running 
with the same priority as the communications process, the transfer rate is reduced from 


10% to 99.5% less of the original values on Table 9 . 
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AE iar 


TRANSPUTER DUN NSOPER RATE- Dy ie sci@re 
PROCEDURE CPUBUSYSUM GONGURREM Ao Sibi Oem 
LOVER se C 


BYTES Out Cer Z OUT see S) olunt Te oe 4 IN OE 


i S S 3 3 

2 a 5 il 3 i 3 7 it 3 

2 5 7 i 7 15 7 LS is 7 

8 Sil TS om 15 Sa 15 31 30 Pe 
ike ol 30 61 eu el 30 ou 61 30 
32 128 61 ee Gal 120 61 120 120 61 
64 Ze 121 230 120 234 120 234 234 Zt 


10000 3386 3100 3380 3083 3370 3065 S222 e255 B05! 
* Values in kbits/sec. 


TAS rea 


TRANSPUTERSLINK TRANSFER RATE BYTE SPICE 
PROCEDURE, CPUBUSY PROD CONCURREN ex i003. 
LONER TTS Sie 


BY TES Our ee 2 OUT Z2tNoOulessoun ce Be 4 IN 4INOUT 


IL S 3 1 3 3 1 
2 7 3 if 5 Mi 3 7 i 3 
o LS 7 Le 7 ie if LS eS Ui 
8 oul 15 Sil Ls oil iL) Si 20 I 
16 61 30 61 30 61 30 au 61 30 
22 ae) ol aa 61 120 ol 120 Zo: Gill 
64 253 12d 235 120 34 120 234 Z35 zal 


1OCU0% 3376 3091 3362 S076 3374 3068 3343 3417 3050 
* Values in kbits/sec. 
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Figure 4.3 Transputer Link Transfer Rate - BYTE SLICE 


Procedure Cpubusysum Concurrent at the B003 - 10 mbits/sec. 


00cy 


This is a great surprise for us because we are only timing the 
communication itself and although we can not prove, it looks like the communication 
process is alive and sharing CPU time with the cpubusy process, instead of being 
inactive while the links communicate, as all the references led us and our predecessors 
to believe: | Nel a spac 

(2) Both Transputers Busy (cpumode= 2 or 6). 


TABLES 


TRANSPUTER LINK TRANSFERSRA Geb Yess IiGre 
PROCEDURE CPUBUSYSUM CONCUR REN ew eer Ss ae 
IOMBITSISEC 


BYTES 1 Our Toe 2 OUT 2INOUT 3 OUT 3INOUT 4 OUT 4 IN 4INOUT 


1 Z iL a i 1 1 : it 

Z 4 Z 2 2 Z Z zZ Z Z 

s 2 4 : ~ 4 4 & t 4 

8 i? g 9 2 g 9 9 9 9 
i ao ike, Ss ave i no a9 i ie 
32 78 So 30 eg 30 ao 39 Si 33 
o4 Ips 78 78 73 78 Te WA 78 ag 
ize Biz oo hog too 156 oe 156 156 155 
Zoc 624 Sez SZ S12 SZ a2 SZ eyez Sin 


SZ 1249 624 624 624 624 624 624 624 623 

1024 2498 1248 1249 1248 1248 1248 1249 1249 1247 
1280 81:20 1550 S50 1567 P56 15 S61 1561 1559 
2048 2498 1665 1665 1665 1666 1666 Hoes Loo 1664 
4096 Ses2 2498 2499 2499 2499 2499 2499 2499 2497 
S92 Saez Zoos Za 50 2856 Ze56 2856 2856 2856 2850 
10000 34387 3050 3050 S0E0 3050 E0510 3050 3050 3049 

* Values in kbits/sec. 


Table 18 and Figure 4.4 need no explanation. The results for 
“cpubusyprod” are not presented because they happen to give us exactly the same 
results for “cpubusysum”, as we saw in the previous subsection. 

One may notice in Table 18 column “] OUT", that the value for 2048 
bytes (2498) is a lot smaller than the previous one (3120), and the effect 1s clearly seen 
in Figure 4.4 . What may be happening is that as the buffer declared on the program 
(buffer0) may have the initial bytes of it in the internal memory of the transputers (2 
kbytes), and when external memory begins to be accessed, the transfer rate goes down, 
or reduce the rate of increase, as we can see on the lower curve of figure 4.4, where all 
the remaining curves coincide and have a brake on the rate of increase at the same 


point. This is what the author thinks is happening but we were not able to prove it. 
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Figure 4.4 Transputer Link Transfer Rate - BYTE SLICE 
Procedure Cpubusysum Concurrent at All CPUs - 10 mbits/sec. 
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b. Using the PRI PAR Construct 







TAB EES 


TRANSPUTER LINK TRANS BER io 1b SiG 
PROCEDURE CPUBUSYSU YI COmree TREN Ie THE B00} iGiii 
Oey Se 


















BYTES 1 QUT JIINOUT 2 OUT Z2INOUT 3 OUT SiNneul 4 cul 4. inN 4 iveu 
1 Zo 248 250 166 194 22 tel 166 96 


2 ZU 500 500 326 400 245 S22 335 Te 
= lols v3 ie Gon 645 487 645 648 384 
8 ZZ Loo Lae? HOS Lats Bot iT Ola 0S 689 
16 2762 2ioe 2208 Eg aba ie 1458 Or 1714 1256 
32 3227 Zase ZO? 2326 Zo03 ZEoe 2314 Zone Tee 
64 SoS SOG 7 Sire Zoa0 2988 2667 Zo13 ZOE S 2471 


Ze Save S350 S20 3204 S305 SOS Soo 32,0 2926 
256 soZe 3491 BoeZ 3417 3476 3358 3404 Sooo 3250 
Se SoBe 3574 S00 1 Soe SOLO 3483 Sole 3662 3404 
1024 3684 SoL7 Sao 1 Soe? Sibioz S066 Soon Saco 3487 
1280 3687 3624 Seo7 cool 3640 3584 SS Saco Sow 
2048 Bis) ol Bla .B6 BO7Z Soo7 Sis Sous Soy ooo9 3543 
4096 3694 36438 Seo5 3616 Soya 3025 3624 SeoZ S576 
Sil Z 3698 365.9 3690 Solg Soyo S682 3652 3694 3098 
cod 3614 3638 Se2s 





* Values in kbits/sec. 


Table 19 and and Figure 4.5 show the results for one concurrent process 
running in the BOO3 transputers, and Table 20 and Figure 4.6 the same for all CPUs 
with concurrent process but in all cases communication having the high priority. 

As we see the figures are even better, on the average, than when no process 
was running concurrently, as seen on Table 9. This is why we believe and suggest that 
processes that handle only communications, as the routers, should be given always high 
priority. 

For each of the possible cases, Table 21 shows us the number of processes 
executed in parallel in each transputer. Although they do not have a valuable absolute 
meaning, they give us a comparative value of the behavior of the CPU in the different 
constructs. The reason for that is in the way the program was made. There are some 
intervals between the several communication sessions and repetitions, were the cpubusy 
process would be able to operate, time sliced with the calculations and output to 


screen, done after each of these sessions. 
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PAB BE 20 


orl TER iN K TRANSFER RAE = BYTE SLICE 
Peo erUeCRE CPUBLSYSsUM CONCURRENT Al ALL CPUS (HIGH) - 
10 MBITS/SEC 


BYTES 1 OUT I1INOUT 2 OUT 2INOUT 3 OUT 3INOUT 4 OUT 4 IN 4INOUT 
1 2 DD 500 coe oo 1 2 Zo5 Ze3 Za 


i 3 S ih 
2 1728 ew 948 526 690 SoS 520 525 Zoe 
4 2288 1540 1436 L036 1204 681 yt 99:2 498 


5 
10000 3663 


* Values in kbits/sec. 





TABLE 21 


Miwon. OF OPERATIONS EXECUTED CONCURRENTLY IN EACH 
Gye ay Ve StiCesu SED 


Transputer b003 Transputcer bOO1 

cpu. sum Gol God cpu. sum Sole cod 
le . | 7 ol ae eae inactive | inactive. 
a | SOne a) ison ee ona 8 
ieeeeR “MS 1.3 | le 3.1. fh inactive | inactive 
Pa hss ai ut cst ee] 


* Values are in millions. 


4. WORD SLICE Procedure 
For the WORD SLICE Procedure, it happens that the results are very similar 
to the ones obtained for the BYTE SLICE Procedure and they will not be repeated 


is 


fee Aah az is SGUUSSdA 


OOcol 0001 O0t 0} 












: S 
eet hs ; eae ! \ py (ee | : 
ye ees ty ' t ' ea Vee ' ' 
cre es oe he) 
' \ \ 
nate ey wt ieee Pe ae 
ea ee ae ee ly ! a, ae! 
i 
re Pane ec, a. 1 t rept ia |! : 
yet a toa) i Co) a ; 
Chines ' 1 t 1 t ct 
ip ces eae lS edo eee aera ea eS = 
Bs 4 fo 1 t t is; ‘ot et 2 r 2 cae roca ca lees ~. r © 
salen: ' i 1 faa eesti at ee eels! ee a as \ \ a 
hi ens 1 i atime st i t ee er ' , I 
(ee 1 1 fede! ' I leno i ' \ 
ane ! 1 ee ip aie iy 1 ! : 
ee ta ae Mts ert sat f 
aa } I \ pahedlianl ; et i el \ 4 ' 
pw ' i oie I : le i ’ ! 
ee ee com aio: a nee | ! 
vc ‘ j ' et oe 1 oe 
ao Re a a ----- aga; ee G so a aeqeep anode be-- tL = —- -=----~ Ss 
a ' \ ie 1 ' te olay lo ' 1 t OQ 
1, i. Ave it oe “ i 
ae wie te Jaan) ae ; * , 4 
1 por t t 1 1 pa we Jer — ' 
hee La ’ eee t / a , : 
2 a. i i ne al eee : eee ' B , i; 1 | : 
1 (a 4 f ' I 
1 1S eee ft i gee ee et 
' t 
Pepe, 2 aoe ee tn ee ee | a ~ 
te 47 e.41° -=- = ie -——<———- = qq" =— -T.-- oe ee em om 5 om ten pia Wt a lin, baer - = ae a a ee oe oe = 
[ora 1 t t Lepe 7 Poe Cs: fee hs 7 © 
Pe ee | ’ he ee . 
' ' et 
ole t { aa ase si ee Ph ee 1 : 
mera! ' ' ates: ae a 1 ! ' 
ele si! ! ’ (json i i 
ai tot \ i: tote ' 
ae ee a it Po pageraeearonge 
» 
ie) aioe f Se eae 4 i rat z * N) 
oe alg atl See 4t,J-LJ4., to Ne =. ie ne = 
coe i NiN&tdo7 2 S 
fi 0 I t ! ‘ 
i i I 1 [ect ieny aa ~ CS a 
Pee ties fey teats nO 4 
a i oul pete iol ln eee 
’ ’ 1 ia oe 
; i. a | 2 INCNBHOZ x 
Cie. ee Ol eae : eo Nile: on 
37-4 \ ~ 3 -i Re S ’ = 4 emer oN 
poe ee oe E- : ial i! ENCNBHOC Vv S 
oa EE inns ee — ‘ fot OG os oe)|0 I face riatiy ee: ae 
—s. <3 - ' 1 | 
7. ee et ee weet i a. ee 4 aE 
, | \ ee ae ' i t i : 
eee a te; tee! aig i| =ZMONGHOT a 
| 1 1 1 ie oe 4 : 
sleet | ' t t i I ' Ki 7G) 
eh : eee ts oi : ee hu ' i GNA 
lone ! I ee ee i \ a cis ae I 1 I 


COCb 


SLUY YS4SNUYL 


1S Suan 
76 


Po 


® 


(0 
Figure 4.5 Transputer Link Transfer Rate - BYTE SLICE 


Procedure Cpubusysum Concurrent at the BOO3(high) - 10 mbits/sec. 
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Figure 4.6 Transputer Link Transfer Rate - BYTE SLICE 
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Procedure Cpubusysum Concurrent at All CPUs(high) - 10 mbits/sec. 


here. The reader may refer to all Tables and Figures described in the last section, just 
remembering that for word transfer the minimum number of bytes is 4, and so, the two 
first rows might be disregarded. 

5. Input and Output Primitives 


a. Transmitting and Receiving Bytes 


TABLE 22 


TRANSPUOTER LINK TRANSFER RATE =1NPGe eel Ped aris) 
PROC CPUBLSYSUM CONCURRENT ~ 103 IDIts SEG 


coumode 1 out lin/out 2 out 2in/out 3 out 3in/out 4 out 4in/out 


1 PAR S 1 2 i 3 1 3 1 
2 PAR Z i 1 L it 1 1 1 
1 PRIPAR 1379 230 0 5o 160 110 155 90 
é PRIFAR 575 350 370 220 225 SO 235 M5 


Values in kbits/sec 


Following a tendency observed before, there were no variations for transfer 
rates with respect to the message size. Table 22 shows us the figures obtained for the 
various priority schemes used. 

These results were the same for the procedure “cpubusyprod”, and for this 
reason are not shown. 


b. Transmitting and Receiving Integers 


TABLE 2 


TRANSPUTER LINK TRANSFER RAQTE* NEE CU laa Gy ORDS) 
PROC CPUBUSY SUNPGENCURREMT - (OaMbI is SEC 


cpumode 1 out lin/out 2 out 2in/out 3 out 3in/out 4 out 4in/out 


1 PAR 15 fi 15 7 iS a Is 7 
Z PAR 9 a 4 s 4 = + a 
Ll PREP ARS Lone 970 765 640 640 470 B25 375 
Z PRIPAR 2345 1568 1450 1040 1200 650 geo 480 


Values in kbits/sec 
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Table 23 shows us the results for transmitting and receiving integers with 
input and output primitives. 
Several conclusions may be drawn from the two tables mentioned above: 
e results for integers are in general four times larger than for bytes. 


e <A process running concurrently does affect the communications if under a PAR 
construct. Results are 50 to 100 times smaller than the ones obtained for no 
COMemmnemt process usine the CPU. See Tabie 13. 


e When running communications under PRI PAR on the B003 transputers, same 
results are obtained as with no other concurrent process. One shall compare 
third row of Table 22 (1 in/out), with Table 13 for bytes and third row of Table 
@eevatn labie 15 for intesers. 


e When running communications in PRI PAR in both transputers the best 
Pommstcterates ale Obtdined eltner for bytes Or mtegers. So the concurrent CPU 
process will not affect the communications. 


It is always good to remember that the cpu load cases examined are 
extreme cases that rarely or never will occur in any application program, but the results 
obtained, undoubtedly, show us a relation between cpu load and performance obtained 
on the links. So, referring back to research question 6, we are not able to affirm now if 
the links can operate in parallel with the processor, but next section will address this 


point again. 


Conclusion 6 


Under a PAR construct, a process working concurrently on 


the CPU, will reduce the transfer rate on the links. 





Under the PRI PAR, it looks like the communication process in high 
priority does not suffer anv dragging, but we have still a doubt of how much can a 
process do when the communications are in PRI PAR and are lengthy. This will be 


addressed in the next section. 


a 


B. THE EFFECT OF THE COMMUNICATIONS OVER CONCURRENT 
PROCESSES 


This section addresses the eighth research question below: 


e “What 1s the effect of the communications on the links, over a process that is 
being executed concurrently on the main processor of the same transputer?” 


1. Initial Considerations 
To observe this we needed to time a fixed length process without any 
communications occurring in the processor in which it was being executed, and time it 
later with communications in parallel through the links. As we mentioned before, in the 
latter case we needed to make sure that only the communications were happening 
concurrently, hopefully in parallel, in order to guarantee that the process being timed 


was not being dragged bv other processes besides communication processes. 


PROC counter (CHAN in,out, VALUE tnumber) = 


-- descript 
== description eit entee ee 


--- Sums up the first 100000 integers and add the transputer 


--- number to the tota 
mee KARR KKK KK RRR KERR RRR RK RRKARE RRR RR KAR KKA RK KARR KA ARERR KKERKE 


DEF maxope = 100000: --- number of operations done 
VAR Cn, tOtau 
VAR startime3, endtime3: 


S28 
ate >= tnumber 
an: 7. Gla 
TIME ? startime3 
sea = [0 FOR maxope] 
otal := total + 1 
TIME. 2 endtime3 
out ! total;startime3;endtime3: 


Figure 4.7 Procedure Counter. 


What was done, then, was to make a simple procedure called “counter” listed 
on Figure 4.7, and place it in a transputer with no other process. For this purpose, a 
transputer in a BOO3 board would be more appropriate, because we now are going to 
time the processor itself and performance could be affected in the BOO1 board by the 
terminal driver, user interface and so forth. It is never repetitive to remember that with 
the links measurements these effects were not so strong because the links have constant 
speed of transmission, the 10 mbits/sec bit rate, independent of the processor internal 


cycle and load. 


50 








3 
3 
2 
13 
0 
(5) 
00, 10, 20, 30 
(2) [7] 
(6) 
LEAVES! 
01,02, 03 4 
11,12,13 
21.22.23} !2 le 
31,32, 33 





Figure 4.8 Configuration for Program Test Linearity (17). 
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The procedure “counter” sums up the first 100,000 integers plus the transputer 
number were it is located after receiving a flag, and send the result, startime and 
endtime out through a channel “out”. 

The harness where we placed this procedure was a program called “Test 
Linearity” that will be described now briefly. This program is listed in Appendix F and 
includes the main procedures, Host Proc, Route, and Counter, that are separate 
compilation (SC) and are placed in different transputers. The configuration used for 
this program is shown on Figure 4.8. The procedure Host Proc is the user, keyboard 
and screen interfaces, and it is placed in transputer root. The procedure Route is placed 
in transputers 00, 10, 20 and 30, and executes in parallel the routing procedure and the 
counter. The remaning transputers (12) are all executing onlv the counter procedure. If 
we look close to the topology of the processors on Figure 4.8, we can see that we have 
a tree structure were the leaves are only executing counters, the second level nodes are 
the routers and the root is the host procedure (hostproc), Figure 4.9 lists Procedure 
route. 

2. Results Obtained 

The first measurement done, was the time to execute “counter” and we 
obtained 130 msec., using the tick.to.time routine to convert the tick values. This value 
was obtained in all 16 counters either alone in a transputer leaf or inside the routers, 
meaning that the present level of communications were not affecting the concurrent 
process on the routers CPU! 

Then, to enforce a continuous communication, we placed in each router, 
besides the flags, and in parallel with the counter, three block transfers to the three 
leaves of each router using the BYTE.SLICE procedure with blocks of 50,000 first, and 
then varying from 70,000 to 256 bytes. If we recall from Chapter 3 this would assure us 
at least 105 and 147 msec, respectively for 50,000 and 70,000 bytes, of continuous 
communication, considering the rate of 3.8 mbits/sec. In fact, we also measured in this 
new configuration the time to execute the communication process alone, and it took 
respectively 103 and 144 msec, so implying a transfer rate of 3.88 mbits/sec for the 
three channels transmitting in parallel. This result a little bit higher than the ones 
observed in the last chapter is explained for two reasons: 


1 The use of 15 Mhz transputers with shorter processes inside and so permitting 
most use of internal memory. 


2 The bigger external memory permitting use of bigger block transfers. 


The transfers were then timed in two modes: 
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PROC route(CHAN messagein,messageout,routetol,routeto2,routeto3, 
chofroml,echofrom2,echofrom3,VALUE k)= 


DEF 1=4 ; -~- number of counter procedures 
VAR msg : ==> flag 
VaR beste s (erie 
VAR startime[i],endtime[i]: --- timers 
CHAN softin,softout: --- soft aera declared for 
——— COnmUnlecadtlon with procedure counter. 
SSC re ROG GOunEer 
--- This procedure counter is listed in Figure 4.7 


PAR 
GCOlMtema ScOneOUL, SOLt in, k 
== rolting process itself 
SEQ 


messagein ? msg 
SEO 
PAR 
route com esc 
routetoZ ! msg 
routeto3 ! msg 
softout ! msg 
PAR 
echofroml ? results-0-;startime-0-;endtime-0- 
echofrom2 ? results-l-;startime-l-; -endtime-1- 
echofrom3 ? results- -2-; startime-2-;endtime-2- 
softin ? results-3-; startime-3-;endtime-3- 
25 eat to the root results and timing 
SEQ i = FOR 4] — 
eee ! Peers aetna (nat ine (i). 





Pitre 4 = rrocedure Route. 


e 3 chanout, with three simultaneous BYTE.SLICE transmissions to the counters 
in different transputers, 


e 3 in/out, with six simultaneous transfers (3 input and 3 output) to/from the 
counters in different transputers. 


Note from the procedure route code in Figure 4.9 that a flag was sent to each 
transputer to make sure they were ready for the BYTE.SLICE transfer, and then 
another flag was sent to the local counter procedure and so do the best possible for the 
communications begin together with the local counter procedure. 

As we can see from Table 26, for message blocks up to 520 bytes, no effect 
was noticed on the procedure counter! At this point no further investigation has 
occurred and two speculations could account for the observed data: 


1 May be after 520 bytes long, the arrays being transmitted, begin to access 
external memory of the transputers. If this was the reason, the increase of time 
should be more proportional than the abrupt increase of 35% more in time 
(46/130) with an increase of 4.6% in the number of bytes transmitted (24/520) 
as shown in Table 25. 
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TABLE 26 
TIM MENG OF PR@GEDY RE COUNTER 


a. Message size 50,000 bytes 
- time to execute communications only : 103 msec 
NO -COmmuni Ca tiem s 6.2. ee ee 130 msec 
Zz. With 3 chaneuc. .2eeeeeee. 186 msec 
So WEEN 3 in Cut 6 ee eee 195 msec 


b. Variable message size with 3 chanout 


Timing of procedure Gece =| MSS) 


- time EO CXGCULEe Procedtine #eolimee ha wimeae 


im ene weweer in the leaves 
aoe Bee ea both cases 
70000 elo. 190 Wer 
50000 foo 186 131 
1O000 178 Lis LES 

1000 176 126 ee) 
544 aS re io 
5238 eS lo 136 
5 ZO 1830, 130 130 

<5 139 139 6 





2 The counter is being timed out, when communication takes more than | msec 
to finish - this looks more reasonable in the sense that if the time slice instead 
of 1 msec, that corresponds to 485 bytes to be transmitted at a 3.88 mbits/sec 
rate, is 1.07msec this would give us a transfer of 520 bytes in the period of a 
time slice because : 


e (520 x 8) / 3,880,000= 0.00107 sec or 1.07 msec 

On the other hand, if we compare the total execution time of 195 msec for the 
worst case observed (process being executed concurrently), with the sum of the 
individual times necessary for counter or communications to accomplish its task , 233 
msec, (See in Table 25 a.), we see a mismatch of 38 msec, when the two processes 
might be overlapping in time. 

The great surprise, although was the unpredictable effect on the transputer 
leaves where we have the counter process executed sequentially after the 
communications and the timing only begins after the communications are over (Table 
25 b, last column). We have no reasonable explanation for that. 

So, as we see, no definitive conclusion of how the scheduling of the routing 
process and the counter process is happening, but from the times obtained, there has 
to exist some overlapping, but not total, between the counter and the routing processes 


in the router transputers. The results were consistent on the four routers. 
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Conclusion 7 
The communication indeed affects the process being executed 
in the CPU, for messages greater than a threshold size. 
For our example this value was 520 bytes or bigger. 
Bellow this message size, communications had no effect 


over the process being executed on the CPU. 


This first conclusion sure lead us to do a complete case study, on the subject 
matter varving the counter size, the message size and using another typical process 
instead of the simple counter, and observe the effects. It could be done, in a similar 
way that was done for the links, but time did not permit this to be included in this 


thesis, and is another suggestion for follow-on research. 


C. DOES THE TRANSPUTER ACHIEVE LINEAR PERFORMANCE 
IMPROVEMENTS? 


We could see in Chapter 3 that the four links in one transputer, in some cases 
gives us linear performance improvements, because the transfer rate per channel is kept 
constant while we increase the number of channels in parallel. The reader may recall 
Tables 9, 10 and 11 for 512 bytes or larger. 

If we now look into the process performance, turning back to the Test Linearity 
program, We can say that for this program, each counter took 130 mec to execute and 
timing from the host process on transputer root we have got a total execution time of 
133 msec since the first flag left channels hostoutO to 3, up to the last result was 
Received back. 

A simple test was made mapping all processes assigned to a BOO3 board with four 
transputers, to only one transputer. In this way, one route process plus 3 counters 
would run in parallel in only one CPU, the former routers. See Figure 4.10 that shows 
the new procedure route5 that accomplish that. The configuration now was the same 
one depicted on Figure 3.14, with a different process placement shown on the program 
Structure on Figure 4.11. The results obtained are listed on Table 25. 

As we see to have a rigorous linear increase of performance we should have: 

lmeerctleaci counter time: 
e 517.5 (average)/ 4 = 129.44 , and what we had got was 130 msec each! 
2 on the total execution time 


© 534/4 = 133.5, and what we had got was 133 msec! 
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PROC route5 (CHAN messagein,messageout,VALUE i)= 


PROC route(CHAN ne ein,messageout,routetol,routetoZ2,routeto3, 
ofroml,ec ofrom2, echofrom3, VALUE k)= 
=—- (sep ae acct i “the same of Figure 4.9 and is not repeated. 


PROC counter (CHAN in, out, VALUE tnumber)= . 
---This procedure is the same of Figure 4.7 and is not repeated. 


DEF totiinkseoe2. =--constant fommseit channel definition. 
CHAN pipe[totlinks]: ---soft channel definitions 
PAR 


route (messagein,messageout ,pipe[9+(6*1i) ipe[11+(6*1) 
a eee eee pipe (Be (exi dy), Bee e rok Poke ee Tek4) 1, 1) 


counter (pipe[9+(6*1)],pipe[8+(6*1)],((10*i)+1)) 
counter (pipe[11+(6*i)],pipe[10+(6%*1)],((10*i)+2)) 
counter (pipe[13+(6*i)],pipe[12+(6*1)],((10%*i)+3) ) 


Figure 4.10 Procedure RouteS. 


Another version of the Test Linearity Program was made and mapped to only 
one transputer T414 in a BOO3 board. The time for execution was then 2.3 seconds! A 
last version made for the OPS system running on the VAX VMS run at best in 8.8 


seconds! 


Conclusion 8 


With normal communication load, linear increase 


of performance with more processors may be achieved! 


The routing process does not drag the processor! 
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PROGRAM testlinearity 
KARAKKKKKKKAKKKKKKAKKARKKAKKAKAKAKAKAKKAKKKRKA KKK KAREKKRERKKKREKKAKK 


x Title : Test Performance Linearity 

*x Version 

x Mod: 0 

= Author : Jose Vanni Filho, Ledr., Brazilian Navy 
Pode: dune, 5th aies7 

cal Programming Language = OCCA 1 

x Compiler : IMS D 600 - TDS 

* Brief Description : This version of test linearity 
* mapped into 5 transputers, shows us the increase 

a in time to execute the same processes of version 2 
* with the reduction of the number of processors, by 
x 


a GaGceon Of 2. 
KAAAKAARKRK RK RARKERKRARERAKAKAKARARKK AE RE RE KRK KAA KR AK RRA RAK 


eeseeeee nOstproc (CHAN AjB,C,D EF ,G,H) 


Sime procedure is the same included in version 2 of the Test 
--- Performance Linearity program in Appendix F and is not repeated. 


SC PROC route5 (CHAN messagein,messageout, VALUE i) 
Jails DLOoCcedure aa uthe same of Figure 4. 10 and is not repeated. 


configuration 
dials definitions 


DEF linkOin = 4 
DEF linkOout = 0 


DEF linklin = 5 
DEF linklout = 
DEF link2in = 6 
Der linkZout é 


a 
ot 


DEF link3in = : 
DEF link30ut = 3 


DEF root = 100: 
DEF totlinks = 32: 
CHAN pipe[totlinks]: 


PEACED PAR 


PRgchasun root 
-- link placements and process assignment 


PLACE pipe[0O] AT linkOin 
Pench papeil| AT linkQout 
PLAGE @pipel2| Ab@enklin 
miece pipe|3} AT linklout 
PLACE pipe{4| AT link2in 
Frece pipe(5| AT PinkZout 
PEACE spape iol. AT Tink3in 
PLACE pipe|74 AT link3out 


hostproc erp e Ot perc ipe [ até tps Etsy) 
pipe|1],pipe ; pipe pipe 


PLACED PAR j = LO FOR 4] 
PROCESSOR 10% | 
-- link Perera] AL and Bees s assignment 
PLACE pipe aoa AT linkOout : 
PLACE pipe{ (2*3)+1] AT linkOin 


route5 (pipe[(2*j)+1],pipe[2*j],j) 


Figure 4.11 Structure of Program Test Linearity (5). 
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TAD PEs 


COMPARING COUNTER EXECUTION TIME IN 4 AND 16 
TRANSPUTERS NETWORK 


16 transputers NR 4 transputers NR 
counter 00 | 130 msec | Olan a 520 msec 00. 
counter 01 130 msec Ol 518 msec 00 
counter 02. #130 msec 02 517 msec 00 
counter 03. = = 130 msec 03 515 msec —00 
counter 10 130 msec 10 | 520 msec 10° 
counter 11 | 130 msec ll 519 msec 10 
counter 12. 130 msec 12 517 msec 10 
counter 13. 130 msec 13 515 msec —10 
counter 20 130 msec 0. 520 msec 20° 
counter 21 130 msec Pon 519 msec 20 
counter 22.” 130 msec oe 517 msec 20 
counter 23. 130 msec Wee 515 msec 20 
counter 30. | 130 msec son 520 msec 30. 
counter 31 130 msec 31 519 msec 30 
counter 32. 130 msec ace 517 msec —- 30 
counter 33. 130 msec scan 515 msec 30 
Total Execution 133 msec 534 msec 
(timed on bOO1) 
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V. CONCLUSION 


When this research begun, in October 1986, we had a new machine, working with 
a language that we did not know, and using a concept that still today is considered 
hard to grasp and to work with : Concurrency and Parallelism. After working for eight 
months with the transputer, the first conclusion that come up is: 


e Concurrency and Parallelism are not difficult concepts to understand at all, 
using the Transputer and the Occam Programming Language. 


In this first phase of the research, the evaluation of the Transputer hardware, 
several significant conclusions were reached and they are summarized in the following 
paragraphs, that were obtained from the body of the thesis. They give us a good first 
idea of the real potential and capabilities of the Transputer when programmed in 
Proto-Occam. 

The bit rate in the links is switchable between 10 mbits/sec and 20 mbits/sec. 
When operating at 10 mbits’sec rate, the data rate was at best 3.8 mbits/sec or 450 
kbytes’sec, per channel. So, the eight links will be able, in the best case, to exchange 
3.8 mbytes of data in one second, between two adjacent transputers, because the links 
are reallv able to operate in parallel. We shall remember that to obtain this results, we 
need to use the BYTE SLICE or WORD SLICE constructs, with messages larger than 
256 bytes. Equally, when switched to 20 mbits /sec rate the maximum data rate 
obtained was 6.1 mbits/sec. 

When a computation bound process is running in the cpu, with the same priority 
as the routing process, it will reduce the transfer rate on the links for any construct, at 
least 8% for one channel operating, and 21% for any other number of channels. These 
results were observed for message size 10,000 bytes or smaller. 

On the other hand, if we give high priority to the communications, the cpu 
process will be executed in the same way, and the communications will keep the 
previously obtained rate of 3.8 mbits/sec, so this 1s strongly recommended. 

Communications in the links will reduce the performance of a process being 
executed in the same CPU, when message sizes overcome a threshold size, depending 
on the process type. For our observed case this value was 520 bytes. For larger 
message sizes, the maximum reduction in performance for the computation bound 


process was 50% in the worst case (Six channels operating in parallel). 
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The transputer is able to increase throughput linearly with the increase of the 
number of transputers in which the process 1s executed. 

Although very promising, these conclusions are not complete and here follows 
some suggestions for follow-on work in the evaluation: 


1 To investigate the usage of the internal memory by the processor, specially if 
priority 1s given for data or program execution code, to be placed in internal 
meniory. 


2 To investigate how the scheduler handles long communication processes that 
are consuming more than one time slice. 


3. To use a Logic State Analizer capable to sample in a clock rate of 50 or 100 
mhz, to more precisely measure the time delays involved in the receipt of a 
frame and dispatch of respective acknowledge. 


4 To time the amount of time needed for an array of variable size to be 
transmitted through several transputers to a non adjacent destination. 


5 To use the Link Evaluation Program with greater message sizes. This would 
imply in using BOO3 boards that have 256 kbytes available, per transputer, 
instead of the 64 kbytes available at the BOO1 board, or a replacing the BOO] 
board by another board with larger external memory. 


6 To make a thorough study of the effect of link operation over a computation 
bound process. 


7 To benchmark a network of transputers configured in a hypercube with the 
commercially available hypercube computers, like the Intel IPCS-VX, using the 
Operating System presented by Cordeiro [Ref. 6]. 


Another suggestion for research is the development of real-time application 
programs to observe the behavior of the machine under normal work load situations. 

It is important to mention at this point that, as advertised, we could indeed use 
transputers with different internal clock cycle, communicating with each other with no 
problems at all. 

Equally important is to remember that in all results obtained in this research, we 
were using bytes or integers, with no floating point operations. So one other 
recommended topic for investigation, is the link and processor performance evaluation 
for floating point data. This could be done in two ways: 

1 By using software floating point available in Occam 2, or 


2 By using the hardware floating point that will be available with the T-800 
transputer. 


If we could state, our impression about the transputer, the small size, the 


simplicity and the speed are the things that really stood out. 
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Asmamlindlesuseestion, tO enlarge the research horizons at the NPS, we 
recommend the replacement, when possible, of the the BOO! board, interfacing with the 
VAX, that turned out to be a bottleneck for our 160 MIPS capable Transputer System, 
either in processing speed, or in memory availability. 

Occam is a very easy language to use, the fold editor is very powerful and 
friendly, and the channels are very good elements for synchronizing processes. But as 
soon as the Ada compiler becomes available, the research should follow that way and 
then, comparison with the previously obtained performances, will be helpful in judging 


the applicability of the Transputer in military real-time systems. 
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APPENDIX A 
LEARNING SEQUENCE 


a. How to Log in 

The first thing one will need is an account on the VAX-VMS to use 
OCCAM. 

There 1s a group account username “OCCAM” , and through the CS. 
Department staff one can get a sub-account to it. 

Once a person gets a sub-account, one shall have a password and a login 
name (normally the last name). With this, one should go to a terminal VT 100 or VT 
220 (no other terminal will work !), log in, and as soon as the “S” prompt appears, the 
VAX/VMS System is ready to begin. 

If by anv chance, the person already has one account in the VAX/VMS 
system, what he/she may want to do is to work from his own account. That will be 
possible, but as soon as the S appears and before one tries to use any of the OPS 
or/and TDS commands one should type either: 

® opssetup --> to use the OPS system, or 
e tdssetup --> to use the TDS system. 

These commands are already included in the login.com file of the OCCAM 
account and it is a good idea for one to include them in one’s login.com file too. 
Another thing one may need to do 1s to move to the “dua0:;OCCAM]” directory to 
copy files and libraries already created and that, certainly will be useful and save time 
for anybody. 

b. Learning Sequence 
i. Silene 
The first thing one needs to know is how to use the VMS Operating 
System. One good choice is to run the online tutorial VMSCAI and/or get a VMS 
tutorial from the C. S. Department [Ref. 22]. If the person is completely unexperienced 
it Will take two sessions of two hours each, to get a good feeling for tt. 

2. Step 2 

When one feels comfortable using the VAX/VMS, the next step is to get 
acquainted with the fold editor. This is a very powerful editor but most likely it will be 
new for anybody, and if one needs more information on it, he/she should refer to the 


Occam Programming System Manual [Ref. 13: section 2]. 
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To execute the tutorial : 


e copy from the OCCAM account the file “OPSTUTOR.DOC” using the 
following commands at the S prompt: 


e “set default duaQ:Occam ” (this will move you to OCCAM directory) 
e “opscopy opstutor.doc [.vour_directory]” 
e “set default [.vour directory] (to move back) 
e type: ~ ops opstutor.doc” at the $§ prompt in your directory 
This will open the opstutor.doc file and will appear on the screen on the 
upper left 


“Press -ENTER FOLD- to start session” 
we Orsrl TOR. DOG 


At this point one should press the key “0” and while pressing it press also 
key “7” (both Keys are on the numerical keypad on the right side of your keyboard. 
This is the ENTER FOLD command. From here on just follow the on screen 
instructions. 

It is likely one will need about two hours for the first time, but as one 
keeps using the editor he/she will find it most easy and powerful. It is a must to have 
a card with a xerox copy off the keypad description codes. See Figure A.1!!. 

Be Siep 3 

Learning the Occam language is the next thing to do. 

One may even begin reading the Occam Programming Manual 
[Ref. 13,: section 3] or Pountain’s book [Ref. 12] early in the learning process, if 
desired. If the reader knows any other structured language such as PASCAL, ADA, or 
C it will be most easy. It is very important to get a good grasp of the channel concept! 

4. Step 4 

At this point it would be good one know some thing about the transputer 
hardware, and architecture. The Transputer Reference Manual is the reference, but the 
technical notes from INMOS or the existing theses will also help. 

ye step 5 

At this point one have a choice of learning one of the three systems 
available at the NPS: OPS, TDS for the VAX, or TDS for the PC. They are a little bit 
different and a good choice for the beginner will be the OPS. This will enable the 


person to use the Occam language for create concurrent programs, that will be 
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Figure A.l Keypad for Using the Fold Editor. 


compiled, linked, debugged and run on the VAX. The OPS Manual is the main 
reference for it. 


6. Step 6 
After that then, depending on which system you will work you should learn 
the TDS for the VAX or for the PC. The reference manuals respectively are the main 
reference, but the Theses by Cordeiro or Vanni present several hints and suggestions 
that may help. With respect to Occam the only different skill one will need is how to 


make configurations. Again both theses will help. 
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APPENDIX B 
OPS TUTORIAL 


Ll. Introduction 


This appendix will describe briefly how to use the OPS system, resident on 


the VAX’/VMS, to write a program, compile, link and execute it. It will not be a 


complete description of the system and it assumes the reader already knows how to use 
the Fold Editor and the VMS Operating System in the VAX, and had alreadv been 


exposed to the Occam Language. The main reference is the Occam Programming 


Svstem Manual. 


2. The Existing File Types: 
In OPS there are several user file types identified by the file extension: 


“ops - these are source files, folded, that may be edited, and once in the 
program format, may be compiled. These can not be printed. 


“lis” - these are listing files that may be used as a VMS file for any purpose. 
The copy, type, print commands on this operating system work with no 
problem. 


“,.obj” - these are object files that were already compiled. Thev may be linked to 
Make an .¢xe file. They are not printable. 


“exe” - these are executable files that were compiled and linked already. They 
also can not be printed. 


3. To Start the System 
Once one is logged on the VAX/VMS on a terminal VT-100 or VT220, the 


first command to type 1s: 


opssetup - this will enable all the following commands used in the OPS to be 
recognized by the VMS Operating System, through the “ops kernel” (opskrnl) 
resident on the Systems Directory. 


4. To Open a File 


Type: 
ops “filename” - this command may be applied to any ”.ops” file and will make 
the file available to be edited with the fold editor. Every time one exit the 
outermost fold, a new version will be created on the VMS file System. Keep 
track. 


5. To Make a Procedure or a Program 

The procedures and programs have a similar structure as in Pascal or Ada. 
After the procedure name with parameters follows the constant, variables and channels 
declarations and procedures defined onlv inside that procedure (subprocedures), and at 
last the main program that may begin with an WHILE, IF, SEQ, ALT, PAR, PRI 
PAR, a replicator, and so on and terminate with a colon(:). The best is to make all 
procedures with separate compilation (SC) capability, and for that we should apply the 
utihty MAKE SC PROC to the procedure fold line. 

The program has no parameters and no colon at the end, but The strictire 
is the same as described for the procedure. It 1s important to say that in Occam one is 
not obliged to declare all constant, variables and channels at the beginning of the 
procedure. It may be done before any process. A process begins with any of the above 
mentioned constructs. The best way to learn is to look at ready programs so we will 
stop this section here. When we use the utility MAKE PROGRAM the name program 
will automatically appear in front of your program name. 

The global definitions and library are very useful to easily make programs, 
and it is a good idea to put them in any program. 

6. To Compile a Ready Program or Procedure 

Any PROGRAM or SC PROC may be compiled separately, as long as the 
utilities “MAKE PROGRAM” or “MAKE SC PROC’ respectively, were applied to 
them and no error message occurred. To execute the compilation, the cursor has to be 
in a folded line, with a PROGRAM or and SC PROC inside, and then the user should 
execute the utility COMPILE. The system will prompt for the object file name and it 1s 
a good idea to use the same name of the source file. 

7. Debugging a Program During Compilation 

The compiler is quick and every time one gets a compilation error, the error 
description appears on top of the screen and the cursor is placed on the line where the 
error occurred, or one before. The System will be in edit mode and the error may be 
corrected at once. After correcting and exiting the fold, one will be ready to compile 
again, neatly and cleanly. 

8. To Link a Program 

After the object file was created with the compilation, one has to leave the 

Fold Editor and at the VMS prompt (S$) type: 


e link/debug opskrnl, program name 
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The debug is optional and we did not use it too much, but we can say it 
runs and permits one to trace a program execution. After the linking the .exe file will 
be created, and one is then able to run the program. 


9, To Runa Program 
After the ink was done successfully, one should type after the $ prompt: 
e run/ debug program_name 
Again the debug is optional and after this command the program will be 
running on the VAX. If logical errors occur, the two options are either to use the VAX 


On-line debugger or get back to the source code (the .ops file) and place some output 


Meusereem ( sereen ! var ). 


Special Functions Utilities 

FUNC nh HELP FUNC 1 . CHECK 

FUNC t FOLDINFO FUNC 2 : COMPILE 

FUNC s SETUP FUNC 3° MAKE PROGRAM 

FUNC 4. MAKE SC PROC 

_ DESCRIPTOR INFO 

: LOCATE ERROR 
ESTIMATE 

_ SEARCH 
REPLACE 


Key 


This function 1s oblsined FUNC 
by pr e.sing F UNL and Inen FUNC 


ine a ey 
FUNC 
FUNC 
FUNC 


FUNC 


I 
2 
> 
4 
5 
6 
7 
8 
2 
0 


LIST 


This functiun 1s cdlainad 


by pr essing Ine bey 





Picure BeleeOrs Utihties. 


10. To List a Program 
There are two ways to do that: 
1 The first one is under VMS, one shall use the OPS command: 
e opshist filename.ops filename.ls 
This will create a list file on filename.lis to be printed on the VAX on-line 
printer. Be careful here! If one forgets to put a “filename.lis” in the command, 


the source file will be transformed in a list file with the wrong termination. And 


a 


worse, if by chance one purges the director, allethemiold ssuuciure sine 
programer created will be destroyed and will have to be redone, if one needs to 


compile the program again. 


ty 


The second one is under OPS; one shall use utility LIST. This may be applied 
to any fold inside the program and the user will be prompted for a file name.lis . 


It is important to mention that every time one lists a file, the folds will be 
opened, and appear sequentially. It is not very easy for a begginer to follow a printout 
of the file. The fold editor permits us a much better block view of what the program 
looks like. So most likely if one has a very hard bug to solve, debugging from the 
Screen wi leWetedsict: 

11. Final Remmarks 

There are other commands and utilities that after a while one may need to 
use, but for the verv beginning, the ones listed here will suffice. Figure B.1 show all 
OPS Utilities and how to call them, on a VT-100 Terminal. The FUNC means that one 
should press the 0 Key at the numerical keypad and the number on the keyboard (NOT 


PF KEYS). Figure &firstpro presents a simple program as an example. 
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SoeeROGRAM hellol 
== aeselio] 
mann KRAKKKAKKARKAAKAKKAAAKKAKKAARAKARAAARKKAARKAAARAARKAARARKKRKAARKKR 
--- This 1s_a the first program in OPS to be seen by a beginner. 
--- This fold contains a simple occam program which says hello. 
ech EMeNmMesSsage appears Ol the Screen you Cal type any 
peed teacter. === [e will be echoed on the screen (no automatic 
--- line feed or carriage --- return. 


=== When you Bype 0 ene program ends. 
mae RAKAKKAAKKAARAAKAKAKKRAKAAKKARAAKRKRAARA RRR RK RA RARAAKA KARA KARA 


-- declarations 

DEF hello = "hello! press 0 to stop a3 
DEF EndBuffer = -3: == system s Cohscan 
CHAN Screen AT 1: --- system's channel 
CHAN Keyboard AT 2: =--- system's channel 
Vee ich: 

VAR going: == BOGMecn 


a main program 
SEQ i = [1 FOR hello[BYTE 0]] 
Sepeen |! ehello[BYTE 1] 
Screen ! EndBuffer --- EndBuffer needed when outputting strings 
Olng := TRUE 
Vitis going 
Se 
eyboard ? ch 
Semecche ech bndburfer 


F 
ch = #30 --~ Hex value for ASCII 0 
going := FALSE 
E 


Figure B.2 First Program in OPS. 
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APPENDIX C 
TDS TUTORIAL 


I. Introduction 


This appendix will describe briefly how to use the TDS system, resident on 


the VAX/VMS at the NPS, to edit, compile, down load and execute an Occam 


program. It will not be a complete description of the system and it assumes the reader 


already knows how to use the OPS System, the Fold Editor and the VMS Operating 


System in the VAX. The main reference is the Transputer Development System 
Manual, D-600. 


2. The Existing File Types: 
In TDS there are several user file types identified by the file extension: 


“tds” - these are source files, folded, that may be edited, and once in the 
program format, may be compiled. These can not be printed. 


“Ist” - these are listing files that may be used as a VMS file for any purpose. 
The copy, type, print commands on this operating system work with no 


problem. Originally the extension was ”.lis”, but we suggest the programmer to 
use other termination in order to identify the file. 


“tcd” - these are “transputer code” files originated from an extraction after a 
compilation was successfully completed. They are not printable. 


“.cde” - these are non- executable files that were compiled and extracted already. 
They will exist when the programmer uses closed files inside his program, and 
contain the code for a file. They are not printable. 


".dsc” - these are descriptor files and will exist only when the programmer used 
closed files in his programs. They are not printable. 


3. To Start the System 
Once one is logged on the VAX/VMS on a terminal VI-100 or V1I220, the 


first command to type 1s: 


tdssetup - this will enable all the following commands used in the TDS to be 
recognized by the VMS Operating System, through the “ops kernel” (opskrnl) 
resident on the Systems Directory. 


4. To Opena File 
Type: 


tds “filename” - this command is to be applied to any ”.tds” file and will make 
the file available for editing with the fold editor. Every time you exit the 
outermost fold, a new version will be created on the VMS file System. Keep 
track. 
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5. To Make a Procedure or a Program 

The procedures and programs have a similar structure as in OPS, so thev 
will not be repeated here. 

The global_definitions and library are very useful to make programs easily, 
and it is a good idea to put them in anv program.There are two global_definitions, one 
for each of the systems identified by the extension. Be careful to imbed in your 
program the “global _def.tds.” 

There are two different things from OPS in a program for the TDS: 


1 The first is: to see any result on the screen, one must include inside the program 
the terminal driver, provided by INMOS, for the board that one is using (BOOL, 
BO02 or BOOS), 


2 The second is the need for a configuration. The configuration basically gives 
names to the physical channels and places in each transputer the process to be 
executed there. Rather than try to explain here, the best is to browse some of 
the several configurations existing in the Theses by Vanni or Cordeiro, or in the 
programs already existing in the Group account Occam. 


6. To Compile a Ready Program or Procedure 
Any PROGRAM or SC PROC may be compiled separately as long as the 
uulities “MAKE PROGRAM” or “MAKE SC PROC” respectively, are applied to them 
@mcenerate NO error message. lo execute the compilation, the cursor has to be in a 
folded line with a PROGRAM or and SC PROC inside, and apply the utility 
COMPILE. There will be no prompt at this time, except for the compilation 
parameters. If the program has complicated nesting of PAR and ALT constructs, use 
Seek = FALSE. 
7. Debugging a Program During Compilation 
The compiler is quick and every time one gets a compilation error, the error 
description appears on top of the screen and the cursor will be placed on the line where 
the error occurred, or one before, in edit mode and the error may be corrected at once. 
After corrected, exit the fold and one will be ready to compile again. Neat and Clean. 
8. To Extract the Code to Be Executed in the Transputer 
The compilation will create several folds inside the .ops program containing 
Uvemuescriptar atid the code to be executed. To extract the code execute utility ~ 
EXTRACT TO FILE ”. At this point one will be prompted for a filename to extract, 


and we strongly suggest to use the same name of the source file. 
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9. To Down Load and Run a Program 
Once the “.tcd” file was created, the user will be ready to run the program 
on the transputer network. Before you down load, check the wiring diagram (Utility 7), 
and see if the links are properly connected. After this, exit from the fold editor and 
eXeciiteattheay vio () pxounoi. 
TDSLOAD filename.ted 
What will happen is that the file will be opened by the VAX and the 
programmer will be prompted for the escape sequence ({ normally is ESC ESC ESC), 
After typing the escape sequence the transputer become active and the code is loaded. 
Check the Manual if any Error message occurs. After the program is down loaded, it 
will be executed at once, with no need of any other intervention of the user. To stop 
the transputer press reset at the BOO] board. 
10. To List a Program 
There are two ways to do that: 
1 The first one 1s under VMS, one shall use the OPS command: 
e opslist filename.ops filename.lst 
This will create a list file under filename.lst to be printed at the VAX on-line 
printer. Be careful here! If one forgets to put a “filename.lst” on the command, 
the source file will be transformed in a hist file with the wrong termination. And 
worse if by chance one purges the directory, all the fold structure wtie 
programmer created will be destroyed and will have to be redone, if one needs 
to compile the program again. 


2 The second one is under TDS; one shall use utility LIST. This may be applied 
to any fold inside the program and the user will be prompted for a 
“filename.lis”. We suggest the termination to be changed to .lst to differentiate 
from the OPS list files. 
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Il. Final Remarks 


There are other commands and utilities that after a while one may need to 


use, but for the very beginning, the ones listed here will suffice. Figure C.1 shows the 


utilities for the TDS System. 


Special Functions 
FUNC h: HELP 


FUNC f: FOLD INFO 


FUNC s: SETUP 
Key 


This function ts o=tained 
Dy pressing FUNML and then 
the wey 


This function 1s ootained 


by pressing the kay 





TRANSPUTER CHECK 
TRANSPUTER COMPILE 
MAKE PROGRAM 

MAKE SC PROC 
DESCRIPTOR INFO 
EXTRACT TO FILE 
WIRING DIAGRAM 
SEARCH 

REPLACE 


LIST 


Figure C.! The Utilities for the TDS System. 
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APPENDIX D 
HINTS ABOUT OCCAM PROGRAMMING 


The goal of this appendix is two fold. First to mention some different and 
interesting facts that happened to us and may happen to anyone programming for the 
first time in Occam, and second to make some comments about the Link Evaluation 
Program. 

a. Program Structure 

The program structure for OPS and TDS 1s quite similar, just differing in the 
global definitions , configuration, and some predefined procedures. The difference in 
the global definitions is a very critical one. While in the OPS we place the CHAN 
Screen AT 1 and CHAN Keyboard AT 2, in TDS we just declare CHAN Screen: and 
CHAN Kevboard:, because the Screen and Keyboard handling is done throughout the 
terminal_driver, 

The configuration section of a program is the one were we map the physical 
channels and the processes onto the processors, and it only exists for the TDS system. 

The pre-defined run time procedures are described in detail in the TDS manual 
and the OPS manual, but they only can be used with the TDS. Some examples are: 
the BYTE-:SLICE. INPUT, PUT a Te, READ DIE. WORD SLICE OCmir Gaia 

I. A program in OPS 


Figure D.1 describes the structure of an OPS program. 


PROGRAM progname 
lobal_def.ops (collection of system defined constants) 
ibrary.occ (if wanted 

=o eal Pre couunce used inside your program (optional) 


peeeRoe =o= any SSR a uate Con tas procedure that one may 
De eERee --- refer to and call from the main of the Reghai 
PROG. 3 (aucanereree: ee procediige sic at cc ey eeeme alich ie 
=== local deri wero. for the main 
== ac 
on ae it could be PAR, ALT, WHILE TRUE, IF or a replicator 
code 


Figure D.1 OPS Program stmetune 
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2. A Program in TDS 
Figure D.2 describes the structure of a TDS program showing as an 
example the structure of the LINK EVALLATION PROGRAM. 


PROGRAM link.evaluation | 
--- each one of the following procedures have the same structure as 
= —  dealctedsonweiicgure D. 1 


SC koe hostproc pea al eens) --- code for transputer root 
SC PROC transfer0.B003 (parameters) --- code for transputer0O 
SC PROC transferl.BOO3 (parameters) --- code for transputerl 
SC PROC transfer2_B003 (parameters) --- code for transputer2 
SC PROC transfer3_B003 (parameters) --- code for transputer3 


| ---eonrrguiration 
Pine celinviions 
... physical channels declaration 
PLACED PAR 
PROCESSOR ROOT --- ROOT = 100 (one may use any process number) 
...-channel placements (physical placement of the channels 
(according the network topology) 
hostproc (physical channel parameters) 
--- the process hostproc is the outermost placed on 
= EbanSpUsemmeaoOn aid has to be an SC PROC 


EG Och ook Ves Like Sheym@ for Eransputer root, in each of 
PRCGESSOR 1 --- the processors 1t is made a physical channel 
PROCESSOR 2 --- placement and a process placement. 


PeochosOR 3 === 


Figure D.2. TDS Program Structure Example. 


line two Fisures 3) and @):2 give toe the reader an idea of the general 
structure of an OPS and a TDS program. Normally, the terminal driver is one of the 
SC PROCS, inside the process placed in the transputer root, to permit user and screen 
interaction. Cordeiro {Ref. 6], describes in detail how to make a configuration and how 
tO map a program made for OPS into the TDS system, and therefore it will not be 
addressed here. Again, the best way to begin programming in Occam is to look at 
sample programs already made. 

b. Problems and Suggestions 
1. Setting up Some Standards 

Earlv in the learning process we felt necessary to standardize some of our 
procedures when programming. This may be not the best, but this is what we came out 
after several changes through the research process, and is given as a suggestion only: 


e Use all vour procedure and variable names in lower case. The system has some 
predefined variables like “EndBuffer”, and all reserved words are uppercase. So 
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doing this, one will not have problems of naming because both the OPS and 
TDS are case sensitive. For example you may use a variable named “true” and 
no problem with the system defined “TRUE” will occur. 


e When in the code one has a replicator with multiple statements under it use 
always a SKIP as shown in Figure D.3 . That will make certain that the last 
index value is executed. 


e In programs with repetitive interactions with the user, use a new.line after each 
execution and before the new prompt to the user 
(Keyboard ? var). This will prevent unwanted multiple executions. 


e Every time a comment is placedtin the codemuse atvleast 39dasnes., Pinisewie 
enable one to recognize easily in the printout, what is comment, and what is the 
beginning of a fold. 


SEO i = [0 FOR 5] 
SE 


in ? varl 
OUL Vote 
SK LP --- this is the SKIP we felt necessary 


Figures Dis me SIP Usace: 


2. When Making Any Procedure 
In order to permit any procedure to run in parallel (always), with any other 
process, use as much channels as possible as parameters, instead of VAR or VALUES. 
The channels will enable the programmer to exchange data between two procedures 
without a procedure call. This is the key for the parallelism. One good example were 
this was used is the procedure cpubusysum, in Appendix E. Other examples can be 
seen in the library routines defined inside the procedure getchoice, also in Appendix E. 
Also make the procedures, SC PROCs, as much as possible. This is better 
for the programmer because if an compilation error occurs, it will be detected earlier 
and the recompilation time will be shorter. It is also better for the compiler because it 
stavs away from the compilation limit. 
3. When Compiling 
When compiling, several errors may be flagged. If an error message: 


e ”... shared variable varname” , occurs, change the check compilation parameter 
to false. When check is true even the output of the same variable to several 
different channels in parallel, will make the compiler flag the error, when it does 
not exist really. 
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If any errors occur, the compiler will position the cursor always before the 
error exact position. Sometimes the error will be on the same line, and sometimes in 
the next line of code. 

4. When Making Large Programs 

When making large programs, one should take care of the compiler code 
limit either for OPS or TDS. In the VAX this limit is around 100 blocks, or 50 kbytes 
of code. To get around this problem, one should make some procedures inside the 
program as Separate Compilation (SC) procedures and the compiler than will be able 
to handle it. 

5. When Down Loading the Code 

When down loading the code, several times a message like the following 

one will occur: 


e *... Illegal board function” - we had that a lot with no reasonable cause. The 
action taken when this happened was to down load again, sometimes up to 4 
times to have the code down loaded properly to the transputer network. 


c. Comments About the Link Evaluation Program 

The Link Evaluation Program takes about 340 blocks of the VAX, or 
approximately 170k of code and comments. 

Our approach in doing the Link Evaluation Program was Top down and we 
think it this was the right one. First the general structure was made, with all 
procedures but the user interface and the terminal driver replaced by stubs. When this 
was running, then one by one the byte.slice.transfer, the inout.transfer, the 
word.slice.transfer and finally the int.transfer were added. Even though all these 
procedures where pre-tested using dedicated harnesses, some times new bugs came out 
as they Were put together. 

In general the structure of the program is based on the four procedures just 
mentioned, that reside one of each, in each of the transputers. When executed,the the 
user choice of construct make the respective procedure be executed in parallel in all 5 
transputers. 

I. Most Common Errors 

e Bad definition of buffer limits and lack of initialization. 


¢ mismatch of channel usage - a process outputting to a channel that no other 
process was waiting for an input. 


e¢ compilation limit achieved - this happened in procedure hostproc and in order 
not to affect the performance measurements, the SC procedure get.choice was 
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created using part of the user.interface code, and so procedure user.interface 
passed to call getchoice. 


The difficulty of finding the first two problems is due to the symptom to be 
the program freezing in execution on the screen and no message coming. To find were 
the error was occurring approximately, we placed some “Screen ! var” statements in the 
middle of the code, and from then on only reading the code and guessing what it could 
be, worked. We tried, and succeeded, also to trace the execution, by looking at the 
listed code and following the flow of communications. 

As a final comment, the facility to reuse previously created software is 
tremendous. Each configuration just need to be done once, and can be always reused 
by just changing the name of the placed procedures. The procedures and programs can 
be annexed to a new file or filed with one key stroke, the utility file/unfile of the fold 


editor. 
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APPENDIX E 
THE LINK EVALUATION PROGRAM 


-- header.occ 
mm RK KA AKA RKKKKKRKKRAKKKKRKKRRRR RK RERRRRRKRKRRRRRRERRKKRERERRREKRKERERKKE 


--- * Title : Link Evaluation Program s 
--- * Version : Ss 
--- * Mod : O < 
=== * Author : Jose Vanni Filho, Ledr., Brazilian Navy * 
--- * Date : June / 02 / 1987 K 
= Programming ERIE Nef. : OCCAM 1 = 
--- * Compiler : IMS D 600 - TDS e 
--- * Purpose : To Evaluate the Transputer link transfer rate x 
ae for several channel parallelism situations, * 
aaa GCQnstruce types and different cpu loads x 
mane RRKKKAARKAKKKAKRKAKKAKKERARRR KEKE RARER RRR RRA RRR ER ERE RERERERERERERKK 


Seeeglet description of program 

mn KKAAKARARAKKARKKRAARKKAKKARARARAKKAKKKKRKAKKRARARARARARARARAERKAKRARRKKKE 
--- Interactive program that uses the INMOS links at 10 Mbits/sec and 
--- evaluates the transfer rates from the bO0O1 board to the b003 board 
=== using one to four channels in parallel for output and input. 

ae tempEOghamn calculates and display the tmansterm nate aftersa 

a Bier ues NUMDEr Of runs (20 eGemeniewoueiea table format for, 

the following block.size and channel configurations: 


-- Block Sizes 
--- 1 - 2° 4 - 8 - 16 - 32° = 64 - 128 
--- 256 - 512 - 1024 - 1280 - 2048 - 4096 - 8192 - 10000 


Bee icine CONLigurations . 

oe ou = 1) ‘caannel(Venepuc) an one link, 

=== channels(input and ee in Datei Otles stink 
channels(output) in parallel in two links | 

channels abe and ouEEEt) in par in two links 

channels(ourput) ingparaliel in three links) 

channels aE and CUESUI metas par Inm@eatee® links 

channels(output) in parallel in four links 

channels(input) in parallel in four links | 

channels(input and output) in par in four links 


POP BW WNNrH Fr 

fe 

ro 

res 

O 

c 

ct 
Prrrrnris 
COP 6) WBN NF 


-- User options during program execution 

--- User Options: 

——— © GrUs MODES OF OPERATION 

aa QO - No concurrent process in the cpus 


--- 1 - BOO3 cpus with sum process concurrently (par) 

=== Zee cPus With sum process concuncent1y (par) 

= 3 - BOO3 cpus with sum process concurrently (pripar) 

-<-- 4 - all cpus with sum process concurrently (pripar) 

aa Seu SsCpUS Witivartayepmeduice process cGoncurntentily (par) 
Si 6 - all cpus with array product process opeaven ty par) 
aie 7 - BOO3 cpus with array product process concurrently Been 
a5 8 - all cpus with array product process concurrently (pripar 
== Celis Gnvuets AND DATA TYPES 

=> Peeereuc/ OUTPUT Channels CHARACTERS (BYTES 

=o5 EuGeoyte Slice inputjoutput ( CHARACTERS (ByiHs 

Sle I - input/output channels INTEGERS WORDS 


--- Wl - word slice input/out ut INTEGERS WORDS 
ma KKK KKAKKAERRKRRRAERERRERRERRERERRRARRAARRKRAKKARKKRKRRAKRKRKERRAKKERERAERERE 
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oi KAKAKKKAKKAKKKKKKKAKKKKKKKKKKRKAKKKKRKKKRKAKKKKKKKRAKKKRKAKAKKKRKAARKKKKKAARKKK 


-- PROGRAM link.evaluation 
a KK RRKKKERKRKEKKKRE RRR RRR EREKREKRERRERERREREKRERRERERAEKKKAKKERKAKARKKEERKRE 


-- link.evaluation PROCESSES 


-- TRANSPUTER ROOTSOOL. SDs 

== SC PROC hostproc 

-- PROC hostproc (CHAN A,B,C,D,E,F,G,H 

PROC hostproc (CHAN A, ByCjD aE .G,he 

=e cescription . 

mae KAKKKRAKAARKAKAAKAKAKKAAKKAARKAKAAKRAKKARARRKA RAK RA RRKAKAKRAKRAKAAKAKRRKARAKE 
--- This is the outer procedure placed on transputer Root. It contains 
--- global variables and constants andwall procedumes Ghar Guim ieemees 
--- transputer. It executes in parallel the procedures : 


--- terminal.driver and user.interface 
mm RK KKK RKKKKKKRE KERR KERR RRR RRR RRRERR RAR RRRRRERERERRERERRERRERRKERAKKE 


-- glopal_deft.tds (partial) 
-- Constants Definitions 
DEF EncBuffer = -3: 


DEF port = 0O:--- assign the i/o port of the BOO1 to terminal 
DEF baud = 1l1:--- set the baud.rate to 9600 bps 
=—--sCOnStanely Usedpese ine omnes 

DEF tab = 9: 

DEF lf = 10; 

DERecy = 13: 

DEF esc = 27 

DEF sp = 12. 

-- Channels Definitions 

CHAN Screen : <---> defined for output to the Screen 
CHAN Keyboard : --- defined for input from the Keyboard 


-- Link Definition 
DEF linkOout = 
DEF lanklour 
DEF link2out 
DEF link3out 
DEF linkOin 
DEF linkilin 
DEF link2in 
DEF link3in 


=a BaP ran Rees (partial) 
== 10 COuUmine sauce 


Sane OC ane we bine 
PROC new.line = 
mann RAKKKKRRAKAKAKKAAAKAARKKARKAKARRKARARA KK AA RAK AAAAAKA KAKA RK KKK AKA 


--- jumps to a new line on the screen 
mann RAKKKAKAAKKAAKARKKARRRAAKARKAAK AR AAKARRRK AKA RRAAKKRR KAKA KARA RAK 
SEQ 
Screen ! ¢Cr-elt Endburter 


-- PROC write String (Vecure s eeaciae 
PROC write string (VALUE string(]} = 
mn KKAKKRARKARARKAARRARAKRARR RAR ARE RARARKRERKRKERERAKRAAKRERAKAAKKAKKE KK 


--- Writes a given string to the screen, in a byte by byte fashion * 
mm KKK REKKEKRRREKRERRRRERRRARRRERRRRERRARRRRKRERKRRRERRRRRRERRRREREREREKRAE 


SEQ 
SEQ i = [1 FOR string| BYTE 0}] 
Screen!) strand (Sv bea) 
Screen ! EndBuffer : 


-- PROC clear.screen 
PROC clear.screen = 
me KK KR KKKKREKEREKKKRRERKKERKKERRERER RRR REE RKRRKKEKKKEREKKKAKKKKKERKKEKKK 


--- clears the screen 
mK KRRKEKKKREKREKRRRRRERKERKKRERKERERERRKEKRKRKERERERKKEKKARKKAEKEKERERERKERKKEKKKER 


e 
e 


Hou th ag 
“ID UTR WDNR © 


SEQ 
Screen ! esc;'[';'2';'J'-sEndBuffer --- clear screen sequence 
Screen ! es¢-')) pea: --- home cursor 


-- PROC write.number (VALUE number) 
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mmm KAKKKKKKKKKKAKKKKAKAKKKKKKKKKKKKKKAKKKKKRKAKKKKAKKKKKRAKKKKKERKE 


Sc ee Oe outputs a signed integer value to the screen * 
mam RARAKAAKKARKKARARAKRAKRKRARRAKAKRK AKA RRKKAK AKER AKA RRAKRAKRARKKR 


PROC write.number(VALUE number) = 
LER Cuemvcldo|, Coume, x: 
EQ 


X¥:= number 


Coune:= 0 
IF 
ah, handle special cases 
x= 
Sereen ' Or 
ei) 
SEQ 
Screen ! '‘-! 
X:=-X 
TRUE 
SKIP 
WHILE x>0 
~~ construct number 
SEO 
Sttemercount| -= (x 10) + '0' 
count := count + 1 
xX:= x/10 


Watee Count > 0 
-- output number 
SEQ 
COUN Ee =s.coune=| 
Screen ! output{[count] 


SKIP: 


== Utilities.occ 
=- FROC transfer.rate (VALUE start,stop,board.type,nr.of.bytes...) 
Peoewecanste:.rate (VALUE Start, stop, board.type, nr.of.bytes, 
VAR rate) = 
me KAKAAKAAAAKAAKAKKKAKRARKAKAKARARAARKKRKKRARKRARKRKAKKKRKKAKKAKKKKKKK 
j——emececives two tick values "start" and "stop", number of bytes _ 


--- and board type and outputs the transfer rate. 
mae RAKKKARAAKAAKKAKKKAAKK AKA KKAKK KKK KAKKKKKAKKRKAKKARAKKAARAKKAK AK 


-- board number definitions 


=~ peaiba.tyoe — 0 -==-> VAX VMS 

--- Oana, tyoe = | =----> BOO] 

ecw pe = 2 —--—> BOOZ . Sa 

--- board.type = 31----> BQ03 hagh prloracy ) 
~-~- board.type = 32----> BO0O3 ( low priority ) 
--—Soeema.cype — 40-=—-—> BOCd 


--- outputs to the screen the transfer rate in kbits per second 
-- constant definitions 


DEF vax.sec =10000000 : --- hundreds of nsec/second 

DEF bOOl.sec = 625000 : ~sneTote.ommicrosecy second 
Depsovdsh.sec = LOOQCOOM: 22s ome iemasec/ SECON 
DEpesdosl.sec = 15625 : --- # of 64 microsec/second 

DEF max.number.of.ticks = 2147483648 : --- maximum integer (2%**31) 


-- variable declarations 
VAR elapsed.tick : 
VAR factor : = ecmeonvei@umabeks TO seconds 


SEQ | 
elapsed.tick := stop - start 
ie 


elapsed.tick < 0 . 
elapsed.tick := elapsed.tick + max.number.of.ticks 
TRUE 
SKIP | 
-~- selection of correct factor in accordance with the board 
Le 
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board ey pes —m0 re Kv is 
factor := vax.sec 


board.type = 1 SoS JelO0l. 

actor: — OC meses 
board.type = 2 =o bh OO Za a | 

Sire --- to be implemented in the future 
board.type = 3l ==) 8003 in hicieron ae 

eReeGrnee bOO3h.sec a 2 
board.type = 32 ==- BOOS in low priority 

factor := bOO3l.sec 
board.type = 4 ~~ pod 

SKIP --- to be implemented in the future 


-- rate calculation 
IF 
board.type = 32 ; 
rate := ((nr.of.bytes*8)*factor)/(elapsed.tick*1000) 
ae operation is done this way to keep precision ok! 
TRUE 
rate := eee eee Eee ee tO na onan 
--- operation is done in this way in order to don't exceed 
--- maxint on the numerator. 


= ne ey 8 due to 8 bits per byte ; 
--- divide by 1000 to have the transfer rate in kbits/sec 


SKIP: 


-- PROC capitalize (VAR ch) 


PROC capitalize (VAR ch) = 
mann KAKKARAKRARKRKAKAKRRKRARARREKKAARAKAKKAKRAKRARAARAKRRARAKRARAKKRKR 


aed capitalizes any lower case character into upper case 
mann KRKKKAKARARAKKKRKAKKRARKRARKRARARKRKRAKKRA RRR ARERR ARARERKKKERKERKKKE 


DEF delta =('a!' - 'A') 


--- A ---> 65 
--- a-r-> 97 ASCII values 
--- Z ---> 122 
SEQ 
IF 
(ch <= 'z') AND (ch >= ‘'a') 
ch := ch - delta 
TRUE 
SKIP 


-- SC PROC IMS.BO01.terminal.driver() 
“= TERMINAL DRIVoke ws 
-- PROC IMS.BO0O1.terminal.driver (CHAN KeyDoa ne ec uces 
ALUE port,baud.rate) 


i KEKKKKKRKKRKKKKRKRKKRKRKRKRKRKRKRKRKKRKRKRKKKKKRKKRRKRKRRRKRRRRRRKKRRRRRKRKRKKRAKKKEK 


--- The terminal driver used is the one provided by the 
--- manufacturer for the board B00] | fenders nat oacou 


--- is not included here. 
ma RR KKKKKK KE KARKREKRERERKEKEKERERKERERKERKEK EKER KR KRKEREKEKRAKRKRARKRKAKKKKK 


ez 


~- SC PROC cpubusysum (CHAN flagl,counterchan) eS UT 

Seer JoraUM. IDS 

-- PROC cpubusvsum (CHAN flagl,counterchan) 

PROC cpubusysum (CHAN flagl,counterchan)= 

= description 

mann RAKAKKARAKARKAAKAAKARKARAAKAARAAAAK AAA AAR AAA AA AA ARR RK AAA RRR KAKA RKRKRK RK 
--- It keeps the cpu working in parallel(time sharing) with the link 
--- transfers by doing sum operations . It stops when it receives 
--- a flag by the channel flagi from the transfer procedure that is 
Soemeoelng Csecutecdueencurrentiy. It outputs by channel counterchan 


--- the number of operations done. 
mee KKAKKARAKKAKKAARKRARAAKAAAKKKARAAAKKAAKAARKKAAK AA AKA KKAKRAARKARRRR 


Uebea,D,e, 
working, 
COURECY, 
Cli: 


SEQ 
counter 0 
working TRUE 
TIME 2? a 
WHILE working 
ALY 
flagl ? ch 
working := FALSE 
TIME ? b 
SEQ 
eC := a +b 
eounter == counter + 1 
counterchan ! counter: 


== CPUBUSYSUHM.dsc descriptor 
== CPUBUSYSUM.cde code 


== SC PROC Beep us yProd (CHAN flagl,counterchan) ee eG CLUIGIE 
Pome rub US Le NUD. 1D 

-- PROC cpubusyprod (CHAN flagl,counterchan) 

PROC cpubusypred (CHAN flagl,counterchan)= 

earacseriolilon 

mane RR RKKERAKKAKKARKKAAKAKARAKARARKARAKARAAKAAAAKAAKARRARKAKAAKARKKAKKARERKKR 
J eseeneeps the Cpu working in Eee een sharing) with the link 
soo tkansfers a domnigeanGay MUuluipltecations., ITE stops when receives 
--- a flag by tne channel fiagl from the transfer procedure, that 1s 
--- being executed concurrently. It outputs by channel counterchan 


--- the number of operations done. 
mane RKARKARAAKKARAKRRAARKAKKAKRAKAKAKAKARAARAKKARKRKAAKKAKAKRKAAKKKKRKKAARE 


-- constants and variable declarations 
DEF number = 100: =-— Size Ofearnay 
ifSR a(number + 1] ---- array of integers 


b{number + 1], ---- array of integers 

ejnumber + 1}, cr-- array of integers 

Slock ===- integer -variable to get time 
working, ==== pDoolean =to stop execution 
counter, ---- integer -number of operations done 


en 


SEQ 
-- initialize buffers and variables 
SEQ i= [{ 1 FOR number ] 


BEO 
2 
Die) <= nou 
SKIP 
counter ;:= 0 
working := TRUE 
WHILE working 
ALT 
flagl ? ch 
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DEE 


DEF 
DEE 
Det 
DEE 


CHAN 
CHAN 
CHAN 
CHAN 
CHAN 
CHAN 
CHAN 
CHAN 


working := FALSE 
Tine: Clock: 
oEe 
EQ i = [1 FOR number] 
efi] := afi] * b[1 
counter := counter + number ---update ns mo emorencd cm 
Counterechan !Seeunter. 


global constant and variable declarations for a a 


sizetable = TABLE [| Ioa24- 66 eo cyeos to ee 

1024, 1280, 2048, 4096, 8192 
nr.of.sizes = 16; --- as counted from above table 
maxblock.size = 1l0QG0-" > --- lasts snemsthne sabeve taste 
reperurlone= 20. ~== LOL averaqiad Pulueoses 
maxwordblock.size = maxblock.size/4: 


hostind AT linkOin: 
hostinl AT -linkiin: 
hosting eA inie en. 
hosting Ar link3in: 
hostoutO AT LinkOout: 
hostoutl AT linklout: 
hostout2 AT LlinkZ2out: 
hostout3 AT link3out: 


10000 ]: 
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-- PROC inout.transfer (VALUE ree yes cmaceunedt ) 
PROC inout.transfer ( VALUE repetition, cpumode)= 
Gio Ser 1a t LON 


mmm ANA KAKA ARARKKKKAKAKKKKKAKKKAKARKAKKAARRRKRKKKAARRKRKKR RAR KRKKKAKKKKKKKRKER 


--- It initializes the buffers and it executes the Bepee cur € 
Booetocnansteh, Tatice when dppltedble oncom tne following: 
--- See eee or cpubusy.sum. (according to cpumode) 


--- Uses global constant maxblock.size. 
mae RAKKARARARKAKAAKKKKKAKRARKK AAR RAK KKK RAR AKKAKKKAKKARKAARKAAKAKKRARARKA 


-- variable declarations 
CHAN flag, See AS tile wc mlr OmaItGD 
counter: --- return the number of operations cpu did 


VAR bufferO [BYTE maxblock.size + 1] 
bufferl [BYTE maxblock.size + 1], 
buffer2 [BYTE maxblock.size + 1], 
burters (BYTE maxblock2size + 1): 


-- PROC iotransfer (VALUE repetition, cpumode, CHAN flag, ...) 
PROC iotransfer (VALUE repetition,cpumode,CHAN flag, counter)= 

== Description 

mn RAK KR RAAKAKKARKKARKKAKKKARKAAAKARAKAR RAR ARKRRKRARARRKARRAKRRRK 
Beeeeexe cites Sequel eae Se cuoepaudube met ansters of eyes 
Saaete, from One to four erdiceuterns eed the input/outpu 

--- primitive and output to the screen the transfer rate 

--=- values of the output TABLE. 


--- Uses global constants : sizetable, nr.of.sizes, repetition 
mm KRKKKEKRK KKK KERR AKRKRAKKEKKRAKEKKKKAKRAKKKKRRKAKAKEKKRRRAKRARERKR 
-- variable declarations 
VAR block. size, 
actual.rate, 
Bate; . . 
ana --- the number of operations cpu did 
G 
deadtime, deadtimeO, deadtimel, ---- to calculate deadtime 
timeO{4], 
timel [4]: 


SEQ 
SEO i = [0 FOR nr.of.sizes! 
SEQ 
Se eile) fee emmecorne 
block.size := sizetable[i] 
write.number (block.size) 
Screen ! tab 
-- calculation of deadtime 
TIME ? deadtimed | 
SEQ i = [1 FOR block.size] 
She 


TIME cece dime ae 
deadtime := deadtimel - deadtimeO 


-- output to one channel 

actual.rate := 0 7 

SEQ j = [1 FOR repetition] 
SE 


Os tin” 7 seh 
TIME ? timed 
SEQ k = [1 FOR block.size] 
estouco 'epurrery) [SYTE Kk} 
TIME ? timel1[0] 


timel(0] := eccr - deadtime 
transfer.rate (time Woe te 2 etek gece 
actual.rate := ((actual.rate jJ-1)) + rate)/j 


SKIP 
write.number (actual.rate) 
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Screen 1 etab 


-- output/input from one channel 
actual.rate := 0 
SEQ j3 = [1 FOR repetition] 

SE 


OStind es cline 
TINE eeeme ORG . 
SEQ k = [1 FOR block.size] 
PAR 
hostoutO ! bufferO [BYTE k] 
hostinO ? bufferl [BYTE k] 
TIME ? timel[0] 
timel[0O] := = (Ee 0 - deadtime 
transfer.rate(timeQ|0 timel[Q],1 »block.size,rate) 
ee actual.rate := eran rate a -1)) + rate)/3 
write.number ae 
Screen ! tab 


== OULDUG GO EvOectwanne ls 


actual. watewi= 
SEQ 3 = [1 FOR repetition] 
SEQ 
PAR 


hostinO ? ony 
Host aden. eee 
TIME ? timeO[0] | 
SEQ k = [1 FOR block.size] 
PAR 
hostoutO ! buffer0 aE k 
MOSeOUCI yao Uiie lemon elk 
TIME ? timel[0] 


timel(0] := See Loan = dedceame 
transfer.rate(time0|0],timel[0 O},1 1,block.size,rate) 
actual.rate := teoreuae rate * (j -1)) + + rate)/j 


Shr 
write.number (actual.rate) 
Screen ! tab 


-- output/input from two channels 
actudlsndeem.— 90 
See ef = [1 FOR repetition] 

SEQ 


PAR — 
hostino ? orto 
hostinl 2 ent 

TIME ? timeO[0] 

See ee [ 1 FOR block.size |] 
hostoutO ! buffer0O eae k 
hostoutl ! bufferl |BYTE k 
hostino ? buffer2 an Y 
hostinl ? buffer3 |BYTE k 

TIME ? timel[0] 


timel[O] := SE eoHGh - deadtime 
transfer.rate(timeO[0],timel[0 Oj} ,1, PMG nis Ze tcieey) 
actual.rate := Yeectuad rate * (j-1)) + rate)/3j 


SKIP 
write.number (actual.rate) 
Screen ! tab 


-- output to three channels 
AGE a eho Comm”, 
ey - = [1 FOR repetition] 


AR 

hostinoO ? ch[0O 

hostinl ? empl 

NOStiiZ) ee mee 
TIME ? timeod[0] | 
SEQ k = [1 FOR block.size] 
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PAR 








hostoutO ! bufferO [BYTE k 

hostoutl ! bufferl [BYTE k 

Mosteutez.! bufferZ2 [BYTE k 
TIME ? time1[0 
time /(O} -:= eee ecie - deadtime 
transfer.rate(time0[0],timel[0 O},1 te logis.size, bate) 
actual.rate <:= ((actual. rate * (j =i) + rate)/j 


SKIP 


write.number (actual.rate) 
Screen ! tab 


-- output/input from three channels 


actual.rate := 0 
SEQ j = [1 FOR repetition] 
SEQ 
PAR 


HOStINO © ven o 
NOSElmL © “clit 
hestinZ ? clip 
TIME ? timeO([0] 
See ae = [ -l ORB @llock .caze™ | 

















hostoutO ! bufferO [BYTE k 

Hhosvoutl. to Diurrer BYTE. k 

hostout2 ! buffer2 |BYTE k 

hOstine - butterO [SYTE k 

Nostiii Ui remem Ey it ik 

RO@spine se bufeera LEYTE k 
Dine. ? time! (0 
timel[0O] := s(eimeotO] - deadtime 
transfer.rate(time0[0],timel1[0 0} ,1 eMOGinns UZenhate) 
actual.rate := eee rate * (3 i) + rate)/3 


SKIP 
write.number (actual.rate) 
Screen ! tab 


= J OUEDUE EON LONE Chanke ls 


actual.rate := 0 
SEQ = [1 FOR repetition] 
E 
hesiemno -- chi0 
hostini »eeeiis 
Hestinz. -cre 
hOstinssorch > 
TIME ? timeO[0] | 
SEQ k = {1 FOR block.size] 
PAR 
hostoutO ! bufferO [BYTE k 
hostoutl ! bufferl [BYTE k 
hestoutz =! butferZ2 (BYTE k 
HOStOULs se mUererom BYTE kK 
UME) se temo ta) Oo) 
timel[O] := ET emco to) - deadtime 
transfer.rate(time0]|0 time1[Q],1,block. size,rate) 
actual.rate := eee rate * (j-1)) + rate)/} 


SKIP 
write.number (actual.rate) 
Screen ! tab 


-- input from four channels 


eG Guia crews. 0 sale 
SEQ 3 = [1 FOR repetition] 
SEQ 
PAR 


hos tine «7 sehue 
hOstin: 2 chil 
hesiaisiazene Gnies 
hostams 7 -cings 
TIME ? timeO[0] 


ly 


SEO = { 1 FOR block.size ] 


hostinO ? bufferO [BYTE k 

hostin) ?9busterlepeviE k 

HOstinZd vesutferZ |svan ak 

hostin3 7 2outren3s |) Bveeak 
TIME ? timel[0] 
timel[0O] := 5 (eine - deadtime 
transfer.rate(timeO[0],timel[{0],1,block.size 
actual.rate := ((actual. bate ~ (je) ) + rate) 


SKIP 
write.number (actual.rate) 
Screen ! tab 


-- all output and input in parallel 


ACtUaieh a ceo 
SEQ 3 = {1 FOR repetition] 
SEQ 
PAR 
Hosting 2 Fem re 
hos tiie wenn 
hostin2 ? chl2 
hostin3 ? chl3 
TIME timed [0] 
PAR 


SEQ k = [1 FOR block.size] 
PAR 


hostoutO ! bufferO [BYTE k 
hostoutl ! bufferl {BYTE k 
hostout2 ! buffer2 |BYTE k 
hostouts !s butters BYTE k 
Seon {1 FOR block.size] 


AOStInNG) 7 SuUrfenOm ie ie ik 
hOS tim 2 (put Pe tame alee 
ROStinZg 7 sDuULter Zo) cen 
hos tin3ezeout ters (BYTE =k 
TIME 7, tamel (oy 
timel[O] := Sse - deadtime 
transfer.rate(timeOd|0], tamel[Q],1 BEGC: size 
actual.rate := ((actual. rate * (j-1)) + rat tes/3 
Shee 
write.number (actual.rate) 
new. line 
Sgn 
new. line 


~~ send to screen operations done concurrently 
DE 


rate) 


/j 


rate) 
J 


cpumode = '0Q' 
write.string (" No Pere an running concurrently ") 
(((cpumode='!2' ")OR(cpunode )) 
; ((cpumode='6' )OR(cpumode='8'))) 
SEQ 
flageaia a: 


counter ? number 
write.string 
write.string 
write.number 
new.line 

hostinO ? number 


"the bOOl transputer '') 
number) 





"Number of operations (in //) at ") 


Write. string (inten 2B 7 econ ime //) at '") 
write.string ("transputer (b003 
write.number nna 





TRUE 


Sie 
Veco ? number 
write.string ("Number of o erations (alae )/ 2) en!) 
write.string ("transputer 0(b003)'') 
write.number (number 
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new.line 
new.line 


SEQ athe Meter 


transfer 


=SenbciaiezZipdeurters 
SEQ k = [1 FOR maxblock.size] 


SE 
ufferO [BYTE 
PULLerl (BYTE 
buffer2 |BYTE 
- bufrer3 {BYTE 


ti 
a~ 


try 


cpumode = '2' 
PAR 
BOL Ean SE Se 
cpubusysum 
epuntees = 4 
PRI PAR 
lotransfer 
cpoubusysum 
cpumode = '6! 
PAR 
1otransfer 


lk = 'gQ! 
le = Li 
lk = 19! 
le = !32! 


eae cpumode, flag, counter) 


flag, counter 


eee eee cpumode, flag, 


flag, counter 


(repetition, cpumode, flag, 


ubusyprod (flag, counter) 


G 
epinieee = 'S 
PRI PAR 


lotransfer (repetition, cpumode, flag, 


cpubusyprod (flag, counter) 


counter) 


eounter) 


counter ) 


iotransfer (repetition, cpumode, flag, counter): 
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-- PROC byte.slice.transfer (VALUE repetition, cpumode) 

PROC byte.slice.transfer (VALUE repetition, cpumode)= 

== desculpnem 

mn RAR ARKRARAAKKKKAKKAKARARARAARARAKAKAAKARAAAAKAAKAAAAAAAAAA AAR AKA RR 
--- It initializes the buffers and it executes the procedure 

--- transfer, and, when apr iC aes ene Grate following: 

--- cpubusy.prod or cpubusy.sum. (according to cpumode) 


--- Uses global constant maxblock.size. 
me KRAKKRKRARKARAK RRA RKAARKAARRARRRAR RA AKRARKKRAAKKAAAKKKARARKARAKKRKKAR 


-- variable declarations 
CHAN flag, =a flags Ehe Cpls bess top . 
counter: =>- rerunn Che: nutibemse. openaerona coumeana 
VAR bufferO [BYTE maxblock.size + y 
bufferl (BYth maxbloei sizes, > 
but se Z by IE ma bike cic. Zeman 
buffers eLe71E max@ loess 7 cetera 


-- PROC transfer (VALUE repetition, cpumode, CHAN flag, counter) 
PROC transfer (VALUE repetition,cpumode,CHAN flag, counter)= 

==) DesCrile Eilon 

ma RKKK RRR AKAAKKKAAKAKKAAAAKKAAAKAKAAKAAAKAKARAKKAAKKAARKARRAK KA RAK 
--- Executes sequentially several parallel transfers of bytes 

~~~ to/from 1 to four transputers usingeche by ie mbenne hocecdume 
--- and output to the screen the transfer rate values of the 

== S OUEDUE FABEEs 


~~~ Uses global constants =: s 


1zetabl 
RK KKK RERKKRRAEKAKEREKERKERERRRRE 


eet oe 
-- variable declarations 
VAR block.size, 
actual.rate, 
rate, 
number, ~-- the number of operations cpu did 
ch[4] 
timeQ/4], 
eee 


SEQ 
SEQ 1 = [0 FOR nr.of.sizes] 
SEO 


-~ making the table after each 1o operation 
block.size := sizetable[1i] 
write.number (block.size) 
screen ! tab 
-- output to one channel 
actual.rate := 0 Mex: 
SES) = [1 FOR repetition] 
> 


ostindO ? ch[0 
TIME ? timeO[0 . 
BYTE.SLICE.OUTPUT (hostout0 ,buffer0,1,block.size) 
CNS eet tense ; . 
transfer.rate Pe Oe aetaee Te raaeeteey a 
actual.rate := ((actual.rate j-1)) + rate)/j 

ine 

write.number (actual.rate) 

Screen ! tab 


-- output/input to one channel 
actual.rate := 0 ve 
SEQ 3 = [1 FOR repetition] 

SE 


ostind ? cat od 
TIMES, camed ko 
PAR 


BYTE.SLICE.OUTPUT (hostout0 ,buffer0,1,block.size) 
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Bevo oulee. ee Burter! 1, b1lock.size) 
TIME ? timel1[0 
transfer.rate ee ,timel a ,1,block.size,rate) 


actual.rate := eae .rate j- 1)) + rate)/}j 
SKIP 


write.number a rate) 
Screen ! tab 


-- output to two channels 
actual.rate := 0 


Se = [1 FOR repetition] 


PAR 
hostinoO ? ch[0 
hWOStin) seen) 
TIME ? timed[0] 
PAR 
BYTE .SLICEe PEP Ue oe cout ,oufferod, MET cee es 
Byte oblen -OUTEUL (NOS tT Oumm=plErerl,1 block.size 
TG ieed timel[0] 
transfer. rate(time0[0 timel[Q], 1,block.size,rate) 


actual.rate := Veo rate k(j- -1)) = rates/j 
See 


write.number (actual.rate) 
Screen ! tab 


a output/input from two channels 


aGhiate race == 0 
SEQ j = [1 FOR repetition] 
SEO 
EAR 


hostinod ? So 
HOStinl 2? sem. 
TIME ? timeO[0] 
PAR 
BYDe.SbICe. OUTPUT ChostoutL, FoOWereno,1,block. es 
Bede, SLiGhrOUrPUl(nOSstouel pufrferl,1,block.size 
BVle.SLuGEe. Pa eoc eee -buffer2, 1, block, Sze 
BYTE. Sbaen. INPUT ChOStinl buffers, ‘block. size 
TIME 2 timel[0] 
transfer. pate ae [O] ,timel[ [0], 1,block.size,rate) 
ae actual.rate := (actual. rate x (5- 1)) + rates/j 
write .number — 
Sereen ! tab 


-- output to three channels 


actual.rate := 0 
SEQ j = [1 FOR repetition] 
SEQ 


hocin0. ? chic 
hostinl ? ch 
nOoStinZ 2 eh 2 
TIME ? timed[0] 





PAR 


BY RaecLIGe OUTPUT (hostout0 buffer0,1,block.size 
Palas elon Ure nosteler, ourrerl,1,block.size 
By Pee ou lCe.OUTPUL(hostout2 ,bufferZ,1,biock.size 
TIME ? timel[0] 
transfer. rate(time0d[0], time1[0], 1,block.size,rate) 
actual.rate := ((actual. rate x (j- -1)) + rate)/j 
Ske 


write.number (actual.rate) 
Screen ! tab 


-- output/input from three channels 
Seti socom. — a0 


SEQ j3 = [1 FOR repetition] 
Sd 
ARO 
hostinO ? ch[0] 
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hostini@e chil 
hos ti nZesqmel 2 

TIME ? timeO[0] 

PAR 
BYTE . SLICE. OUBEUL Nos toueO) Fliers mlocr.size 
BYTE .SLIGs OUTPUT (nostourl butter! 1 plock.size 
BYTE. SHEGsOULPUR hos LOUEZ ‘buffer2, 1,block.size 
BYTE.SLICE.INPUT(hostin0O -buffero, ie cee Sige 
BYTE. SLiCe LINEUP Cnese ana ‘bufferl, i, "block. S2L2E 
BYTE .SERG2 > MIEUD(hostin2 butter | ‘block. size 

TIME ? timel[0] 

transfer. gee timel[Q], 1,block.size,rate) 

ae actual.rate := eee rate x (j- 1)) + rate)/j 

write.number — 

Screen ! tab 


== QULPUL to four channels 

actual. rate := 0 

SEQ 3 = [1 FOR repetition] 
SEQ 


PAR 
hostinoO ? 
HOS tints een 
hostin2 ? 
hostins + cn 

TIME Se time0[0] 

PAR 
BYTE. SLICE OUTPUT NostourOyeuErer”, 
BYTE.oL. Ch-OULPUL nostoumlr /pufferl1, 
BYTE .chace.OUBEUR( hostoutZ, burtfer2, 
BYTE.SLICE .OUTPUT(hostout3, ‘buffer3, LOLOCK. size 

TIME ? timel[0] 

transfer. eee cen time1[0], 1,block.size,rate) 

actual.rate := eee tl rate x (4- 1)) + rate)/3 


,lock.size 
,olock.size 
,olock.size 


ioe ee 


SKIP 
write.number ins 
Screen ! tab 


-- input from four channels 
actual rate 2= 70 


aaa 2 = [1 FOR repetition] 


AR 
hostino 
hostinl 
hostin2 
hostin3 

TIME ? timeO[0O] 

PAR 
BYTE.SLICE.INPUT(hostin0O,buffer0O,1,block. Size 
BYTE.SLICE.INPUT(hostinl, ‘bufferl, i "block. size 
BYTE.SLICE.INPUT(hostin2, ‘buf fer2, il ‘block. size 
BYTE.SLICE.INPUT(hostin3, ‘buffer3, 1 block. size 


TIME ¢ timelio 
timel[Q], 1,block.size 


VW ew) *4) ~~) 
OQ 
2 


[0] 
transfer. rate(timeO[0] einey 
= actual.rate := eee rate ko (4- -i)) 1 rate)/j 
oh 
write.number (actual.rate) 
Screen ! tab 


-- all output and input in parallel 
actual.rate := 0 


SEQ j = [1 FOR repetition] 
SEO 


PAR 
hostinO ? 
hostinl 786h 
NOSsttnZ =e 
hostin3 ? 
TIME ? timeO[0] 
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PAR 
Byles olce OUTE UL nNoswoure,ourterO 1,block.size 
Ben ob LCE CUMPUT NOstouel pouLtterl aly block.size 
Sve sel Ch.OuUme Um hostoute DurrerZ., hu Brock, size 
Brig woh ter -OunPUR noSstoullts, buffers. ‘block. size 
Bv ip shi ce. (Nell nosting ‘buffero, 1 Teck size 
Bib. Selee.INPuUL ( hostinl. ‘bufferl, La ‘block. size 
EY res olles- Ilr Ul (nostinZ. ‘buffer2, he ‘block. Salat 
Evie. Slice. INPUT( hostin3 ‘buffer3, i ‘block.size 

TIME ? timel[0] 

transfer. rate(timed[0] 


Sime (© |) bp lock.size, rate) 


Pi actual.rate := (( heoeo sae rate * (j- ie rate)/3 
write.number (actual.rate) 
new.line 
SKIP__ 
new.line 
oe send to screen operations done concurrently 
I 
cpumode = '0' 


write.string (" No ECSU Nome ruhmihig concurrently ") 
(((cpumode='2! OR(cpumode=' OR 
oe estes '6' )OR(cpumode='8'))) 


flag ! ‘a! 
counter ? number 
Meite. String Number of operations in aate tees!) 
write.string ("b0O0Ol transputer '') 
write.number (number) 
new. line 
hostinO ? number 
write.string ("Number of geresosn (Gitae/ oma tl 
Melee. string || transputer Conese) 1) 
write.number auneen) 
TRUE 


SE 
heceing ? number 


write.string ("Number of aeanon os (nies) ) at =!) 
Write-string 4 transputer Geleltle yt 
_ write.number Rabe 
few line 
new.line 
f2oen--- Main byte. slicestranstfer 
la ees buffers 
Ses = {1 FOR maxblock.size] 
ere rOmVe tie sen. s= 7 10" 
Putter le lBy Te jo see 
But Per Zee cee ieee 
buUrrers quer le. Kiera "93" 
Shar 
re 
eauumode = "2° 
PAR 
transfer (repetition, cpumode, flag, counter) 
coubusysum GElag, counter) 
cpumode = '4! 
PRI PAR 
transfer (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter) 
cpumode = '6' 
PAR 
transfer (repetition, cpumode, flag, counter) 
pees Pred (flag, counter) 
cpumo 
Pai Pak 


transfer (repetition, cpumode, flag, counter) 
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cpubusyprod (flag, counter) 
TRUE 


transfer (repetition, cpumode, ELAG,  cOunten yr 
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-- PROC int.transfer (VALUE repetition,cpumode) 

PROC int.transfer ( VALUE repetition, cpumode)= 

-=- cescription 

men KRARAKKRARARKAKKAKRAKAKARARAKARKKRERKKARERKKERKRKEKAKKKKKE RAR KKKKKKKRKE 
--- It initializes the buffers and it executes the procedure 

--- intransfer, and, when applicable one of the EanIeTnInG 

--- cpubusy.prod or cpubusy.sum. (according to cpumode) 


--- Uses global constant maxblock.size. 
mae KRRKKRARAERKARRARRRERAAK KARR RK RRR RARRAKRAKKRERKRRERRERKRRKKRERKKE 


-- variable declarations 
CHAN flag, === flags the cpWero stop 
counter: --~- return the number of operations cpu did 


VAR wbufferO [maxwordblock.size + 1] 
woufferl [maxwordblock.size + 1], 
wouffer2 [maxwordblock.size + 1], 
wboufier3 |maxwordblock.size + 1]|: 


-- PROC intransfer (VALUE repetition, cpumode, CHAN flag, counter) 
PROC intransfer (VALUE repetition,cpumode,CHAN flag, counter)= 
wore sSchioc1on 


AR gre ee eh, 


— RARARARRARKAKRARRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRI 


=== Executes Sen ea several parallel transfers of pee ers 
--~ to/from one to four transputers using input/output primitives 
--- and output to the screen the transfer rate values of the 

--- output TABLE. 


--- Uses global constants : sizetable f.sizes epetition 


tr. 0 be 
=e KR RKKRRKAKRKKRKKRAKKRKAKAKR RRR RRRERRARRRARRRREKKKRKR KR RRRERKE 


-- variable declarations 
Wane block.size, 
actual.rate, 
baie; 
ane --- the number of operations cpu did 
Cc f 
deadtime, deadtimeO, deadtimel, ---- to calculate deadtime 
ee 
timel|4 


SEQ | 
SEQ i = [0 FOR nr.of.sizes] 
SEQ | 
=" Malang the table 
block.size := sizetable[1i] 
write.number (block.size) 
Screen ! tab 
cy 
block.size < 4 — 
eeppeetscn ies ins transfer for integers '') 
write.string("is 4 bytes(word)"') 
TRUE 
SEQ . : 
-- calculation of deadtime 
TIME ? deadtimeOd 
SEQ i = {1 FOR (block.size/4) ] 


TIME ? deadtimel — 
deadtime := deadtimel - deadtimed 


-- io handling 

-- output to one channel 

actual.rate := 0 ae 

SEQ 3 = [1 FOR repetition] 
SE 


OS imOes Chilo 
TIME ? timeO[0 . 
SEQ k = [1 FOR (block.size/4) ] 
ostoutO ! wbhufferO[k] 
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TIME ? timel[0] 
timel[O] := timel[O] - deadtime 
transfer.rate (time0- Eine is0—) ee 
eee Size, rate 
= actual.rate := (Geta. rate * (j-1)) + rate)/j 
S 


write.number (actual.rate) 
Screen ! tab 


== OULDUt/ INDUL thom One cChanne! 
ac tual. Pace soe 
oem a = [1 FOR repetition] 


OStinod: 2 ene 
TIME ? timeO{0 
ae [1 FOR (block.size/4) ] 


hostoutO ! wbhufferO [k] 
hostinO ? wbhufferl 
TIME ? iene 


timel[O] := Soe = (Gleveohe aly 

transfer. peee eee 0], time (0],1, block .sizegeaees 

actual.rate := ((actual.rate * (j-1)) + rate)/}j 
Sher 


write.number (actual.rate) 
Screen ! tab 


-- output to two channels 
ACeUc nec ccm = a0 
SEO. ef = [1 FOR repetition] 


AR 
hostinO ? Pte 
hestinl ? yen 

TIME ? timeO[0] | 

SEQ k= [1 FOR (block.size/4) ] 
A 


hostoutO ! wbhuffer0d Hey 
NOs COME Sie otie hem lamas 
TIME ? timel|/0O 


timel[O] := SG - deadtime 

transfer. rate(time0d[0 timel [0] /] 1,block.size face) 

actual.rate := eee Bae aa -1)) + rate)/j 
SiGe 


write.number (actual.rate) 
Screen ! tab 


== OULpUt/inpUut from Evemenanmers 


actual.rate := 0 
SEQ 3 = [1 FOR repetition] 
SEQ 
PAR 


hostinoO ? Sat 
hostinieeeechi = 
TIME ? timeO[0] | 
SE [ 1 FOR (block.size/4) | 
P 
hostoutO ! wbhufferO [k 
hostoutl ! wbhufferl |[k 
hostinO ? wbhuffer2 
hostinl 2? weuzfers ik 


TIME ? timel(0] 

timel[0] := S(MaeOT G] SSdeadtamne 
transfer.rate(time0[0 ee ea size rate) 
actual.rate := eee rate (j-1)) + rat te$/j 


Sie 
write.number (actual.rate) 
Screen ! tab 


-- output to three channels 
actual.rate := 0 
SEQ 3 = [1 FOR repetition] 
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W? 


EQ 
P 


hostmn). 2 chit 
hostinZ2 ? ch 
PINE 7 etameo (0 
a OULBU handil ng 
Se es [1 FOR +n ock.size/4) ] 


hostoutO ! wbufferdO He 


AR 
hostino ? an 
Z 


hostoutl ! wbufferl [k 
hostout2 ! wbhuffer2 [k 


TIME ? timel(0] 
rane Loe: = SS Rec: - deadtime . 
transfer. rate(time0[0 timel[0 Oo} ,1 1,block.size,rate) 
errs actual.rate := Ce rate * (3 -1)) - rate)/j 
write.number (actual.rate) 
Screen ! tab 
-- outout/input from three channels 
actual.rate := 0 
SEQ j = [1 FOR repetition] 
SEQ 
PAR . 
hos taney sch.0 
ghey coboul  & cele\|[ 
HestinZ «- cae 
Mie? tamed! 0) 
== ete handling 
SEpas OR (block.size/4) ] 
hostoutO ! wbhufferO [k 
Hoseouelelewoutferl | k 
hostoutzZ ! wbouffer2 {k 
hostinO ? wbhuffer0O [k 
hostanl 2ewoufferl [k 
hostin2 ? wbuffer2 |{k 
TIME ? timel[0] 
timel[0O] := e(timeo {0} - deadtime 
transfer.rate(time0|{0 timel [0] ,1 block.size,rate) 
ae actual.rate := eee bate *~ (3-1)) + rate)/3 
She 
write.number (actual.rate) 
Screen ! tab 
-- output to four channels 
AC culate m= 0 
SEO = [1 FOR repetition] 
EQ 
PAR 
hosting saci 
hostinl, 2 Cail 
HostanZ2 ? “chi2 
Hhestans 7 en3 
ieee timed[0] . 
-- input Leet cutput ene 
SEQ = [1 FOR (block.size/4) ] 
hostoutO ! wbufferO [k 
hostoutl ! wbufferl [k 
hostome)Z2 ! woufferZ ik 
hostout3 ! wbhuffer3 [k 
TIME ? timel([0] 
timel[O] := e(tined| - deadtime 
transfer.rate(time0[0] ,timel[ [0] ,1,block. size,rate) 
actual.rate := ((actual. mace = (j=1)) + rate)/3 


SKIP 
write.number (actual.rate) 


Le 


Screen ! tab 


-- input from ty channels 

actudi eae me 

SEW ame | LaBCr See ion 
SEQ 


PAR 
hostind ? ch/0 
Hestinie sea 1 
hosting <een|Z 
hostins  - ens 


TIME 2 Seimeo | C] 

ae ee aan lian 

oe [ 2 BOR heen size/4) ] 
7 


hostinO ? wbufferO [k 
hostinl ? whufferl [k 
hostin2 ? wbuffer2 [k 
hostin3 ? wbuffer3 [k 


TIME ? time1[0] 

timel[O] := SE oe - deadtime 

transfer. rate(timed/[0]/,timel [0] ,1,block.sizemmates 
afte actual.rate := eae. rate * (j- 1)) + ratejya 
write.number (actual.rate) 
Screen ! tab 


-- all output and input in parallel 


ACUUdHe Edrem.— 80 
SEQ 3 = [1 FOR repetition] 
SEO 
PAR 
hostinO ? ch[0 
NOS tT nlmmesseta| 1) 
hostinz en Z 
hostin3 ? ch{3 


TIMES? timed[0] 
-- input and aut te Aen 
SER Vs = [1 FOR (block.size/4) 


aeons ! wbufferOd [k 
hostoutl ! wbufferl [k 
hostout2 ! wbuffer2 [k 
efeyeieeibies) | Wwolbidisicies [ils 
hostinO ? wbufferO [k 
hostinl ? wbufferl |[k 
hostin2 92 Wwoustel2aaie 
hostin3 ? wbuffer3 |k 


TIME ? time1[0] 
timel[O] := SR ONen - deadtime 
transfer.rate(time0/0],timel[0 Oj},1 1,block.size, rate) 
actual.rate := Ga teae rate * (j =1)) + rate)/3 
She 
write.number (actual.rate) 
new.line 
SKIP | 
new.line 


-- send to screen operations done concurrently 
DE 


cpumode = '0Q' 
write.string (" No Sea ce running concurrently ") 


(((cpumode='2' OR(cpumode=' 
((cpumode='6')OR(cpumode='8'))) 
SEQ 
flag. 'thvad 
counter ? number 
Write.string ("Number of operations (ine //) at the) 
write.string ("bOO1 transputer "') 
write.number (number) 
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new. line 
hostinO ? number 


write.string ("Number of oe aae Coie oat 9!) 
write.string ("transputer (DOCS a) 
write.number AuRber) 





TRUE 
Se 


oe tinO ? number 

write.string ("Number of SE Soa (aiey 7 ) at. |) 
write.string ('"transputer (b003 

_ write.number number) 

new.line 

new.line 


So inane trains fer 
Be OUt Perse Pott la lizceon : 
a = [1 FOR maxwordblock.size] 
woufferO [k 10000 
woutterl |k 20000 
wouffer2 ;k 30000 
tk] := 40000 


wouffer3 
SKIP 


ee 
Cpumode = '2' 
PAR 
intransfer eee cpumode, flag, counter) 
pus y Sun flag, counter 
coumode = '4' 


PRI PAR 
intransfer hae eee umode, flag, counter) 
pelle sun flag, Betce 

cpumode = 

PAR 
intransfer (repetition, cpumode, flag, counter) 
SO (f ag, counter) 

cpumode = ‘3! 

Perak 
intransfer (repetition, cpumode, flag, counter) 
cpubusyprod (flag, counter 

TRUE 
intransfer (repetition, cpumode, flag, counter): 


ey 


-- PROC word.slice.transfer (VALUE repetition, cpumode) 

PROC word.slice.transfer (VALUE repetition, cpumode)= 

ao description . 

me KKAKAKKAARARKAARKARKAAR KAA AAAKAAKA AAA AARARAKARKARAAAKAAKKAKAKAAKKAR 
--- It initializes the buffers and it executes the procedure 

--- wordtransfer, and, when deelredstescne on eeiemee lowing 

--- cpubusy.prod or cpubusy.sum. (according to cpumode) 


--- Uses globa constant maxblock.size. 
KKK KKK AKA AKKKRRARAK KEKE RRR RRR RRKEARERRRKRRRRAKERAAKKK KAKA AAKK 


-- variable declarations 
CHAN flag, -<- flags ethe cotmecmates . 
counter: --- return the number of operations cpu did 


VAR wbufferO [maxwordblock.size + u 
woufferl [maxwordblock.size + ; 
wouffer2 [maxwordblock.size + 1], 
wouffer3 [maxwordblock.size + 1]: 


-- PROC wordtransfer (VALUE repetiriommmcpunedopmecc les lagers 
BESS wordtransfer (VALUE repetition,cpumode,CHAN flag, counter)= 

= Cesc r elon 

nnn ESE KR AERA AKARRRRARK RAR AK KARE AERA AAPOR AAR AR 
=== eSeceu was Sede a several parative thdhohcmomOreerMECGcigs 
--- to/from one to four transputers using the WORD.SLICE Procedure 
--- and output to the screen the transfer rate values of the 

=== OUUDUGe la wlege 


--- Uses lobal constants 


> size etition 
KRAKKKKKAKKREKKEKEKRRERKKERE 


tab LAOL a 
Kk KRRAKKEKRRKKK 


le, n ~Suve sear 
KKK MAKKKKKRAKKREKEAR 


== vor lable dectabaenons 


VAR block.size, 7] MuUnbe h wots) TesmrC Demsrmcnsm heed 
Actua Boece SoG che Cusco clic mettle 
rate, --- auxiliary variable to hold temporary rate 
number, --- the number of operations cpu did 
ch{[4 
timeO{/4j, 
timel|4}: 


55 
SE0 i = [0 FOR nr.of.sizes] 
SEO 


-- making the table after each 10 operation 
block.size := sizetable[i] 

write.number (block.size) 

Screen ! tab 


FE 
block.size < 4 —_ . 
Meee eae aaa aa ae EbconiGh sie Gamnee Go lisie) 
ret eae "is 4 bytes(word)") 


ma RAKKARARKAKARARARKARKKKRARKAAKRAKAARAR AAR AA KK RRAAKK RR AAKAKAAAAAA AAR 
--- ATTENTION! The code) 1s shveted Nettle apacecmeumc mic he sch, 
due to printing requirements, 
man RAAKAAKAKARARKAKAKRARARKAR RRR RAK RAR RRA AK KARR AAAA AK AAA AAA RK AAK RRA A 
SEO 

-- output to one channel 

ac tualeyates a0 = 

SEQ 3 = [1 FOR repetition] 

SE 


ostinO ? ch[0 
TIME ? timeO[O | 
WORD.SLICE.OUTPUT(hostout0 ,wbuffer0,1, (block.size/4) ) 
TIME “22timenee) 


130 


transfer.rate ce O},timel Sera: size,rate) 
actual.rate := ects -rate i-1)) + rate)/j 

Sie 

write.number a. oes 

Screen ! tab 


SOME Dury npr am one. Link 
actual.rate := 0 


Se =O 3 = {1 FOR repetition] 


post inose ch © 
ae ? timed{0 
WORD.SLICE.OUTPUT (hostout0,wbhurt eee mop lock. Se 
WORD. SLICE.INPUT(hostin0O,wbufferl, t (block. size/4)) 
Tine. 2 time1l[0] 
transfer. rate(time0[0] timel [9], 1,block.size,rate) 
Eee actual.rate := ((actual.rate x (3- -1)) + rate)/j 


write.number (actual.rate) 
Screen ! tab 


-- output to two channels 


actual.rate := 0 
SEQ 3 = [1 FOR repetition] 
SEQ 
Pak 


hosting: 2 ee 
hoOscial 7. clit 
TIME ? timed[0] 
PAR 
WORD SLICE OUTPUT hostouts wbuseeeL 1, eee Sea 
WORD.SLICE .OUTPUT (hostoutl ,wbufferl,1,(block.size/4 
TIME = time, 0] 
transfer. rate(time0(0],timel[0],1,block.size,rate) 
es actual.rate := ((actual. rate * (j- 1)) + rate)/j 
write.number (actual.rate) 
Screen ! tab 


-- output/input in two links 


actual.rate := 0 
SEO j = {1 FOR repetition] 
SEQ 
PAR 


hostin’d 2uch| oO 
MoSstin. ° Chis 
TIME ? timeo[0] 
PAR 
WORD -SLIce OUTPUT (hostoutl wouFfert 1, iaicee Bea. 
WORD.SLICE.OUTPUT (hostoutl ,wobufferl,1, (block.size/4 
WORD SLICE. INPUT (hoctinl wbuffers 1 ee »size/4 
WORD.SLICE.INPUT(hostinl ,wbuffer3 Dlockss1ze/4 
TIME ? timel[0] 
EraAnSten Gate Ceimeo | Ole time1 [0], 1,block.size,rate) 
actual.rate := ((actual. rate x (5- 1)) + rate)/j 
Shier 
write.number (actual.rate) 
Screen ! tab 


-- output to three channels 


actual.rate ;:= 0 
SEQ j = {1 FOR repetition] 
SEQ 
PAR 


lostini ? ‘ena 

hostin2 ? ch 
TIME ? timeOd[0] 
PAR 


He Seo UTE TT CRE ET Genero ns‘ pe tece: See 
WORD.SLICE.OUTPUT(hostoutl ,wbufferl,1, (block.size/4 


hostino ? cht 
2 
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WORD.SLICE.OUTPUT (hostout2,wbuffer2,1,(block.size/4) ) 


TIME ? timel[0 
transfer. BE eee timel[0], 1,block.size,rate) 


actual.rate := ioe bene rate z (5- 1))> + rate)/j 
SKIP 


write.number es 
Screen ! tab 


== QUtpUL/inpUuL In vehnee ines 


actual.rate := 0 
SEQ 7 = [1 FOR repetition] 
SEO 
PAR 


hostinl  aewa 

hOstinZ 2 ech 
TIME ? timeO[0] 
PAR 


hostino ? cht 


WORD.SLICE .OUTPUT(hostoutl ,wbufferl, 
WORD.SLICE.OUTPUT(hostout2 LI 
WOR eb t CH CUM eae /wbuffero, t {bieek, size/4 


WORD. SLICE OUTPUT \hostoutl »wouffer0d, ay ‘block. size/4 
ie 


WORD.SLICE.INPUT(hostinl ,wbufferl, block.size/4 

WORD. SLICE. INPUT(hostin2, wouffer2,1,(block.size/4 
TIME ? timel[0] 
transfer. rate(time0[0], timel[90], 1,block.size,rate) 
actual.rate := ((actual.rate z (5- ee rate)/j 
SKIP 
write.number (actual.rate) 
Screen ! tab 


-- output to four channels 
actual.rate := 0 
SEe = [1 FOR repetition] 


PAR 
nestand ; 
hostinl : 
hoOstinZ = 
hostin3 

TIM Ees timed[0] 

PAR 


JN 
QO 
a 
NO 


block.size/4 
block.size/4 


} 


Sie 


WORD.SLICE.OUTPUT(hostout0O,wbuffer0,1, (block.size/4 
WORD.SLICE.OUTPUT (hostoutl,wbufferl,1, (block.size/4 
WORD. SLICE. OUIP UM hos tourz “wbuffer2, eek. s laze) 4 
WORD.SLICE .OUTPUT (hostout3 ,wbuffer3 ae block.size/4 
TIME ? timel[0] 
transfer. rate (timed0 ||P cimel [0p hee cekeotzemnace ) 
actual.rate := Wega rate * (j- 1)) + rate)/3 


write.number (actual.rate) 
Screen ! tab 


-- input from four channels 
actual. (areas. 


oe = = [1 FOR repetition] 


AR 
hostinO ? ch[0 
hostinl 2 ch 
hostinZee chilz 
AOStIns) oe eh 


TIVEee timed[0] 
AR 


WORD. SLICE.INPUT(hostin0O ,wbuffer0,1, (block.size/4 
WORD.SLICE.INPUT(hostinl ,wbufferl,1, (block.size/4 
WORD.SLICE.INPUT(hostin2,wbuffer2,1,(block.size/4 
WORD.SLICE.INPUT(hostin3 ,wbuffer3, 1, block.size/4 
TIME ? timel[0] 
transfer. ee coe timel [0], 1 block ssme rate ) 
actual.rate := eetuad. rate x (- Oe rate)/j 


2 


SKIP 
write.number (actual.rate) 
Screen ! tab 


Zoot crc OuL anaeinput in parallel 
actual.rate := 0 


SEQ j = {1 FOR repetition] 
SEO 


PAR 
hostin0d 
hostinl emt 
Nose ti ee 
NOSstans 

oe a a Hs 

are 
WORD. oLtGe. OUTPUT (noSstout® wourrer0, 2 (block.size/4 
ono. SLICE OUTPUT (hostoutl woutfer!l 1 /(block.size/4 
Too OLICh OULU CnOostouGZ woutferZ 1. (block. Stee 
(ORD. SLICE. CUBE UM hestouts woutfer31,(block.size/4 
WORD.SLICE.INPUT(hostinO, wbuffero, if block. aia 
WORD.SLICE.INPUT hostinl ,wbufferl,1, bileck.size/ 4 
WORD.SLICE.INPUT(hostin2 ,wobuffer2,1,(block.size/4 
WORD. SLICE. INPUE(ChOSEInS. \wouffer3, 1,(block.size/4 

TIME ? timel[0] 

Emans rer. rate(time0[0], timel[9], iMislocx.size,rate) 

aie actual.rate := ((actual. rate x (5- 1)) + rate)/3 


--- ATTENTION ! End of code shifted 12 soaces to tne vlett. 
ee RRR RARER RRR REAR RR RRR AARAAARAREKRRRKRARRARE 


ere 


“Weve 


oN] 


write.number (actual.rate) 
new. line 


SKIP | 
new. line 
-- send to screen operations done concurrently 
ie 
cpumode = '0' 
write. ean No ean runming <concurrently  ") 


(((cpumode='2! OR(cpumode= 
fee en ere arch ont(ceurcdesse0)) 
Se 
a6 ! ta! 
counter ? number 
write.string ("Number of operations lines boat the !) 
Marecmsering «(suet ransputer |!) 
write.number (number) 
new.line 
hostinO ? number 





write.string ("Number of Sean a Cine 7 pea!) 
wWElteostring ('transpulber (b003 
write.number Rea 





TRUE 


ha 

ostinO ? number 
write.string ("Number of Seeeeaaa (ama/ J cat") 
White. string ("transputer (b003 
write.number (number 

News) ine 

new. line 





SEQ --- main word.slice.transfer 
-- Bere buffers initialization 
SEQ k fl FOR maxwordblock.size] 
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SEQ 


VDOULEerO mils = 10000 
woufferl |[k = 20000 
wouffer2 {| = 30000 
Wide: ens mins = 40000 
SKIP 
IF 
cpumode = '2' 
PAR 


wordtransfer (repetition, cpumode, flag, counter) 
See (flag, counter) 
cpumode = '4' 
PRI PAR _ 
wordtransfer (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter 
cpumode = '6' 


wordtransfer eee cpumode, flag, counter) 
c Be See (flag, counter 
cpumode = '3'! 
PRI PAR _ 
wordtransfer eR eee oy cpumode, flag, counter) 
cpubusyprod (flag, counter) 
TRUE _ 
wordtransfer (repetition, cpumode, flag, counter): 


peo em Foe’ geeschiolces (CHAN Keyboard, Screen, WVAR@ch, ..., run) 

== PROC get.choices (CHAN Keyboard, Screen, VAR ch, cpumode, run) 
PROC get.choices (CHAN Keyboard, Screen, VAR ch cpumode run )= 

mam RA RAKAARAKKKAAKRKA KARA AKKKAKKAKKKARRRARER ERE RAKEKKARAKRAKKAREKRRRKKRRE 


--- presents menus and gets user's choices of cpumode and construct 
mame RA AAKKAKARKARKAAKAKRARKAAKKAA RK RAR AAKKRAKKAKAKRAKRRAKRARRRARRRAARER 


-- modlibrary.occ 
-- lo_routines.occ (partial) 


Poe seereOG ew oilne (CHAN Screen) 
aa Ek Oe mew. Line (CHAN Sereen) 
PROC new.line (CHAN Screen)= 
SRAAKKA KRRAKKA KAA RKAKKKKKKKKKKKKKKAKKKKKAAKKKKKKKKKKAKKKAKKRA 


cores KRARRKKRKRKK 
= JUnossa tine on the screen. May be compiled separately 
mann RXKERAKAAAARARRARARARKRAR RK RKKARKAR AR ARRARARAKREK RARER ARKRARKKK 


DEF EndBuffer = -3: 


Ditimenhe=a1 5 
DEF 1f = 10 
SEQ 
Bereen cr: 1f-Endsuffer 
== descriptor 
-- code 


== SC PROC write (CHAN Screen, VALUE string[]) 

== PROC write (CHAN Screen, VALUE strin i 

PROC write (CHAN Screen, VALUE string! |} = 

Mn KRKRKRARAKRAAKAKARKRAKKAKKRKKKARE KRKKRKKKKKKKKKAKAKKKKKAKKKKK 
--- Writes a given string to the screen, in a byte by byte fashion 


== ages eC OMpl ed separately 
man RARRAAKAAARKAARKK AKA AKRAAKARKAKRAAKRAKKRAA RRA AKRAKKRERARAAKARREKK 


DEF EndBuffer = <3: 
SE 


Q | 

S20 l=) |t FOR string/[BYTE 0} ] 
Screens rEing|( SY ie 2] 

Screen ! EndBuffer : 


-- descriptor 
-=- code 


-- SC PROC clear (CHAN Screen) 

-- PROC clear (CHAN Screen) 

PROC clear (CHAN Screen)= 

mn RARKAKAARAKKAKKKRKAKAKARAKKKKKAKKKKAKRKAKKKAAKKKRKAKKKKKKKKKEKRKKKKKKR 


--- clears the screen. May be compiled separately 
mn REKAAAKAKAKARARKAKKKKAKRARERARAR AK AARKRKRRAKAARAKKARAKKKRAARAKKKR 


DEF EndBuffer = -3: 
DEF esc = 27: 


a ; 

ereen 1 esc: '='; '2%> "J"; EndBuffer --- clear sequence 
Seréemes! ese. '[". 'H' _: --- home cursor 

SGeseh aeitGl 

-- code 


-- SC PROC write.number (CHAN Screen, VALUE number) 

-- PROC write.number (CHAN Screen, VALUE number) 

PROC write.number(CHAN Screen, VALUE number) = 

a RAAKAAKARKAAAAKKAAKKAKKRARAKAKRRRAKAKRR RR ARAA KKK RR AAKKKKRER 
--- This PROC outputs a oe integer value to the screen | 


=== May be compiled separately 
mn RAARKRAKRKAAKKARKARK AAR ARR AAKRKARRAKKKRAAKRKAKRAKRRARRARRAKK 


VARPOULDUL LG), count, x: 
SEQ 

X:= number 

count:= 0 


oe 
-- handle special cases 
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WHILE x>0 
-- construct number 
SEQ 
output{[count] := (x 10) + '0! 
count := count + l 
xX:= x/10 
WHILE coune > 0 
-- output number 


SEQ 
count := count-l 
Screen ! output[count] 
SKIES 
=— descr iston 
-- code 


=-- utilities cca sm panera) 

== SC PROC Capitabegom shea 

-- PROC capitalize es ch) 

PROC capitalize (VAR h) = 

see LRER RAR RAE AERA RRR AE RR A ee 


--- capitalizes any lower case character into Upper 
a RRAKRA RRR ARKH AAA ARTE REE He eo 


DEF delta =('a' - 'A') 


\-- A ---> 65 
--- a ---> 97 ASCII values 
--- Zz ---> 122 
SEQ 
F 
(ch <= 'z!) AND (ch >= 'a') 
Clr ch - delta 
TRUE 
SKIP 
-- descriptor 
-- code 


-- global_def.tds tiple! 
-- Constants Definitions 


DEF EndBuffer = -3: 
DEF tab = 9: 
DEF lf = 10: 
DES@ecr = 13: 
DEF esc = 27: 
DEF sp = 32: 


-- PROC write. header 
PROC write. header 
ee dE AISA GC eK He NCR SOR ee ee 


--- ene the header of the output tabl 
mam KK KKKKKKKKKKKKEKERKKRKEKERKRKE Eh RR 


SEQ 
Yun == Trun ae 
SEE eeeere 
write(Screen, "RUN # '') 
write.number a Bun) 
SClLecnar ee sp 
ee output the cpu mode to the screen 
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cpumode = '0' 


write(Screen, "cpu mode = 0 (no par proc ) Ae 
Ccpumode = '1' 

write(Screen, "cpu mode = 1 (one sum par ) Le 
cpumode = !'2' 

write(Screen, "cpu mode = 2 (all sum par ) uy 
cpumode = '3' 

write(Screen, 'cpu mode = 3 (one sum pripar) '') 
cpumode = '4'! 

write(Screen, "cpu mode = 4 (all sum pripar) ") 
cpumode = '5' 

write(Screen, "cpu mode = 5 (one prod par) ns) 
cpumode = '6' 

write(Screen, "cpu mode = 6 (all prod par) My} 
cpumode = '7'! 

write(Screen, "cpu mode = 7 (one prod pripar)") 
cpumode = '8' 
Be eCMch cen, tapimmoge — 6 (all prod pripag)" ) 

Sie 


Bence |! (Se; sp 
= output the construct type to the screen 


~ ch = 'A! 
wae Scr eer "input/output channels (bytes) ") 
Cc — 

Bates (Scr sent EBV ih SLECh-input/output (bytes) "') 


G 
Ee "input/output channels (integers)'') 
ch = ‘W 
meer een: "WORD.SLICE.input/output(integers)'') 
SKIP 
new.line bench 
new lane, (sereen 


write Seneen, “BYTEs ) > OUT Pil Ounce OUL ee 2IN/OUT 3 OUT") 
write(Screen, "3IN/OUT 4 OUT 4 IN £44IN/OUT'') 
new.line (Screen): 


VAR answer 


SEO 
Jug 
Runge 50 
SEQ | 

-- output to the screen presentation of program 
clear (Screen) 
write(Screen, '! This is an Evaluation Program for ") 
write(Screen, "the Transputer") 


new.line (Screen 

Pi eeweeereen 0 ft 15 fully interactive andyou will "') 
write(Screen, "be prompted in") 

mewerLine (Screen) 

write(Screen, '' each run to choose cpu mode and type '') 
Wolke  SCLeen, “Of CONS trices.) 

new.line (Screen 

write(Screen, " The output table will present transfer '') 
Weieetscreen, “ wates an) 

NeW iiem oc bccn) am 

WE Pe eneen Uetomes/scamrolacne 16 different ")} 
write(Screen, "block.sizes and the 9'') 

new.line (Screen) 

write(Screen, " channel configurations ") 

Ne wolne ee 

NeWerime (Sekeen . 

DE eoecrech me werk (Y)ES If you want to use it "") 
Nev eine 4 Sereen) . ; 
PaaeeKoGlee ays.) CpiGeeete Vou Want co aqure ) 
new.line (Screen) 


answer := 'z'! 


le 


-- validate answer 
WHILE ((answer <> 'Y') AND (answer <> 'N')) 
Sk 
Eee CoCo Type your cherces) 
Keyboard ? answer 
capitalize (answer) 
Screen i answer 
new.line (Screen) 


Sc iP 
clearvoemcem) 
=~ choosing eyoenos concemice 
-—= PrOmpUulsen tyocuOr Conoumic. 


vrite(Screen, " Choose type Of cons cqucimromecmuscce 
new. line (Screen 
write(Screen, " A for input/output channels (bytes) ') 


New. ee sicz.eem ) ; 
write(Screenje" “BS for BYTE .SLICE inpue/ocucputmeny tes) 
new. line(Screen) 


Write(Sereen, " I for input/output ychannels Nemeladss 
new. line(Screen) 
write(Screen, " W for WORD.SLICE input/output (words)'') 


new.line(Screen) 
-- validate type of construct 


RIL (((ch <> "A')AND(ch <> "Bi ) )ANDIC(Ghe<o Ww AND ch <> 2am 


write(Screen, " Type your choice "') 
Keyboard ? ch 

capitalize (ch) 

Screen ! ch 

new. line(Screen) 


NeWe lime oGLecuy . 

-- eeee ug cpumode during transfers 

== Prompt Ger scousinode 

write(Screen, ' Choose cpu mode during transfers") 

new eine semean 

write(Screen, "Q -> cpus executing no concurrent processes ") 

new. line(Screen ) 

write(Screen, "1 -> B003 cpus executing sum concurrently (par)") 
new. line Cscreen 

write(Screen, "2 -> all cpus executing sum concurrently (par)") 
Newer. blew aciscen) 
write(Screen, "3 => BOO3 cpus executing sum concurrently(pripar)") 
new. line (Screen) 
write(Screen, "4 -> all cpus executing sum concurrently (pripar)") 
new.line(Screen) 

write(Screen, "5 -> BO03 cpus executing array products (par)'") 
Mew Ines eigecm 

write(Screen, "6 -> all cpus executing array products (par)') 

De oe nee 

write(Screen, " 
Degen euscisee me 
write(Screen, "8 -> all cpus executing array products (pripar)") 
new. line (Screen) 


-- validate cpu mode 

cpumode := l 

WHILE ((cpumode > #38) OR (cpumode < #30 ) 
: O < cpumode < 8 (IN ASCII) 
SEQ 


write(Screen, " Type your choice "') 
Keyboard ? cpumode 

Screen ! cpumode 

new. line (Screen) 


write.header: 
-- descriptor 


-> BOO3 cpus executing array products (pripar)") 
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aaCOUe 
peer OC USer. interface 


PROC user.interface = 
ma RAKAKRAKAAKAAARAKAKARAKKAR KAKA RK AA RKAKKAKAKKKAAAAKKAKAKAKAKKARKAKKK 
pemes e Sctics Menus jamcecalis right modules to be executed 
mdm ehemtranseuber root. 
mn RRA RAKKARAAKRAKKAARARKAARKKA RAK AAKAKK RA KAKA KAAAKAKAKAKAKAKARKRKKK 
-- constant and variable declarations 
VAR run : --- number of runs made this time (RUN #) 
VAR answer ‘eee USers Cholce 1m Continue or quit 
NWemeeONstruce =: =--- users choice of construct 
VAR cpumode : --- users choice of cpu mode while transfering 
et Git el 
SEQ 
run := 0 
answer := 'z' 


Crean. scheen 
write.string ("' Do you want to use the Link Evaluation Program?" ) 
-- validate answer 
once <r y AND vcanswer. <>. 'N")) 
new.line 
Peeoesening (! ivpe your choice (YY) or (N)") 
Keyboard ? answer 
capitalize (answer) 


Screen ! answer 
new. line 
woukee ahswer = 'Y' 
SEQ 


get.choices (Keyboard, Screen, construct, cpumode, run) 
-- send choices to other transputers 
PAR 

hostoutO ! construct; cpumode; repetition 
hostoutl ! construct, ecpumode; repetition 
! 
! 


hostout2 construct; cpumode; repetition 
hostout3 construct; cpumode; repetition 
-- executing the right procedure and prompting for new run 
Ing 
Conseruct = A oe 
inout.transfer (repetition, cpumode) 
GConstmucy = 5) 
byte.slice.transfer (repetition, cpumode) 
eens teue a= m 
int.transfer (repetition, cpumode) 
Cons acu = we a 
word.slice.transfer (repetition, cpumode) 
TRUE 
SLE 
-- prompt for another run and validate answer 
answer := 'Z' --- to make the next loop be executed 


WHILE ((answer <> 'Y') AND (answer <> 'N'! 
Syd 


write.string("Do you want another run? Type (Y) or (N)") 
Keyboard ? answer 
capitalize (answer) 


Seheoen Laitsw Gi 
new. line 
-- send answer to other transputers 
PAR 
hostoutO ! answer 
hostoutl ! answer 
hostout2 ! answer 
hostout3 ! answer 


eileen ,scCreen . 
write.string (" Thank you for using the Link Evaluation Program") 


Log 


new. oem 
Write 7s ewe bi Press reset on the BOOU—Rboarad to gew bcc) 
write.String (' (tor Vax, Vise eee 


AR 
IMS .BOO1.terminal.driver (Keyboard,Screen,port,baud) 
user. imterrace: 


wm KAKKKKRAKKRKRRAKRRKKKRKKKKRKKKKRKRKKKKKKKKKRKKKKKKRKAKRKKKRKARKKKKKKAAKRKKK 


--- END OF SGODE IN TRANS USa eo 
me KKKKKAKARAKAAKAKARARKRARKKKRAAKKKKKKKRKAKAKKKAKKKKKAKKKAKKKKKKARERKKAKK 
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Soe oe ULERO-BOO3.TDS 


paeeeenoc Erdnster0.b003 

== "PROC transfer0.b003 (CHAN in,out) 

PR@eeeransserO.b003 (CHAN in, out) = 

-- description 

--- This is the outer procedure placed on transputer 0 . It contains 
--- global variables and constants, and all procedures that run in this 
aumee oO USseurer. It receives 4g Construct type (ch), cpu mode (cpumode), 
--- and number of times each communication sequence (repetition), and 
--- calls accordingly one of the following procedures: 

--- “sor thans rere 

a eee yee 411 Ce.“thanssone, 

aaa = ant. transfert or 


Tas Pee Word.slice: trancrere 
mmm KKKKKKKKKARKAKKRKKRKRKARKKKRKKRKRKRRKKKKKKAKRKKKKKRKKKKKRKAKKKKKKKKKKAKKKKKRKKKK 


-- Link Definition 

DEF linkOin 

DEF linkOout 

Der Ainklin 

DEF linklout 

DEF link2in 

DEF link2out 

DEF lLink3in 

DEF link3out : 

== constant declarations 

Deweotzetanle = TABLE [ 1, 2,4, 8, 16, 32, 64,.128, 256, 512, 
1024, 1280, 2048, 4096, 8192, 10000 ]: 

DEF nr.of.sizes = 16: --- as counted from above table 

DEF maxbiock.size = 10000: --- last from the above table 

DEF maxwordblock.size = maxblock.size / 4: 


-- variables declarations 


WIN OH O10 


VAR ch : =~ SeneGlece OLeEne sel in Eyoe of construct 

VAR answer : "ST TGh@lCCEGHeENesUSer In CONtInlGus 

VAR cpumode_: saeeChotce Of wee Usciaein Golloperation concurrently 
VAR repetition: === choice of the user in number of times to run 


-- SC PROC cpubusysum (CHAN flagl, counterchan) 

-- PROC cpubusysum (CHAN flagl,counterchan) 

PROC cpubusysum (CHAN flagl,counterchan)= 

e= description 

mm KR KK KAKKKKKAKKKAKKAKKAK AAA AKA AKA KK AKARAAKAKAKAAKKRAAAKARAAKAAKAARKAKKR 
So ele Keeps the cpu working in pee (time sharing) with link 
=o thansters oe doing sum operations. It Stops when receives 

Ze cmiLad Oy tne seudine ls etagdmeemenm che orocedure transfer that 

eels Delngdsexecuted concurrently. 


= Outputs by channel counterchan number of operations done. 
oan KRRKAKKKAKKKAKRKARKEKKKRKRKEKKKKKAKKKRKKKRKRKAKKRKRKKRKKRKRKRKKRKKRAKRKRKAKKRKKRKKK 


Van a,b,¢, 
working, 
Soumter, 
En: 


SEQ 
counter 0 
working TRUE 
TIME ? a 
WHILE working 
ALt 
ede le en 
working := FALSE 
Tie 2 5 
SEQ 
e :=artb 
counter := counter + l 
Gouncecrenany — COUNTEL: 


-- descriptor 
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-- code 


-- SC PROC cpubusyprod (CHAN flagl,counterchan) 
-- PROC cpubusyprod (CHAN flagl,counterchan) 
PROC cpubusyprod (CHAN flagl,counterchan)= 


-- description 
KKK KKK REK KA KKKEKKRERK AA AK KEK KK RA KKERKKKK ARERR KKAKKKAKKKKKKKKKKKAAKE 


--- It keeps the cpu working in parallel(time sharing) with the link 
--- transfers ae doing ee multiplications. It stops when receives 
--- a flag by the channel flaqi= puometheenans 6 map noOcec timers cl cemans 
--- being executed concurrently. It outputs by channel counterchan 


--- the number of operations done. 
me RRR KKKKKRKKRRKKRRRRRRERARRRRRRRERERKRERARRERRRRRERERRKRRRRRRRRKKRKK 


== Gonstants and variable declaraerame 
DEF number = 100: ---= size of array 


VAR a[number + ty ---- array of integers 
bi{number + |, s"=-aunay Of Ineedems 
elnumber + 1), 2"==" dhrdy Of ance came 
CLOCK. ---- integer -variable to get time 
working, =-==- boolean -to Stopvexceuleonm 
Soca ---- integer -number of operations done 
Ghee: 


SEQ 
=-"inlvlalize fut ce momcmcin ciaeconec 
a 1 = { 1 FOR number 

EQ 
a 


a) aa 
bi San 
SKIP 

counter := 0 
working := TRUE 


WHILE working 
ALT 
f lagi en 
working := FALSE 
ca ET clock 


SEQ 1 = [2 FOR number] 
e[i] := afi] * b[i] 
counter ;= counter + number ---updatesi nr. of operations 
counterchan ! counter: 
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-- PROC inout.transferO (VALUE repetition, cpumode) 
PROC inout.transfer0 (VALUE repetition, cpumode)= 


Gescription 


mee KRXKKARAKKRRARKRRKRARRKRRKRKRKRRRRKRKRKRARKRRRKRRKRRKKKRARKKRKRRRKRRRARKRKK 


Seeeeldtld basen the Oucrensmanca executes the sprocedures lotransfer, 
== Sis, Wien applicable decording to ecpumode, one of the following: 
So Gpubusy. prod or Chips y.cum. 


Uses global constant maxblock.size 


wee KKKKKKRRKEKRKKRKKRRKRRRRKRKRARKRARARREKRRRKKRRKRKRRAKRKRKERERERRKARKRRKRKRAERKER 


CHAN flag, *=* flags, tie ep to stop . 
counter : --- return the number of operations cpu did 


-- PROC iotransferO (VALUE repetition, cpumode, CHAN flag, counter) 

PROC iotransfer0 (VALUE repetition,cpumode,CHAN done, opnumber)= 

peed ser lOtlon.10 

wane RKKAARARAKKARARKAKAKKAAKRKAAKKAAKAKKAAKAAAKKARAARKAARKAKARKARKAKRK RK 

Se +t exectites Sequenttally several parallel transfers using the 

=e= input/output Bao Ge, from trantsputer root. 
g 


--- It uses the obal constants sizetable, nr.of.sizes repetition 
mm RAKK RRR KAAKKKRERERRRERERRRERERRERERERERRAKERERERERERERERRKRRERE 


-- variable declarations 
Verne tock. Size, 
aa --- the number of operations done by the cpu. 
c : 
VAR buffer0 fe maxblock.size + ae 
VAR bufferl [BYTE maxblock.size + 1]: 
SEO _ ee 
Peer relatize: Durrers 
SEQ i = [1 FOR maxblock.size}] 


SE 
uffterO-By tears i\8 
bufferl-BYTE i- 1\8 

Sle 


oo = [0 FOR nr.of.sizes] 


lock.size := See ae ta 
-- input and output handling 
= ERDUL. ou Cimmonle selenimae 
SEQ j = [1 FOR repetition] 
SEQ 
ot a ee 
SEQ k = [1 FOR block.size] 
Tee to Wee tO) |B ies eek) 

Solr 


-- input/output to/from one link 
SEQ » = [1 FOR repetition] 
SE 


Outi a 
SEQ k = [1 FOR block.size] 
PAR 


ieee our rere lo. TE S] 
out ! bUEFer=|BYTE k} 
SKIP 


ae Input § rome t wou Granmne.s 
SEQ } = [1 FOR repetition] 
SEQ 


GWE! at 


SEQ k = [HMSOR block osize) 
in ? bUPErerO [EQPEa ik 
Shir 


-- input/output to two links 
SEQ j = [1 FOR repetition] 


Gut “!-Va" 
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oEee = {1 FOR block.size] 
AR 


in) 2eoe fer 0) Sauber 
Gut ! butter lL isy Fee 
SKIP 


-~ input from three channels 
SEQ 2 {1 FOR repetition] 


DUE was 
Sa) k = {1 FOR block.size] 
in ¢ buffero| BYTE «| 
She 


-- input/output to three links 
BED } = [1 FOR repetition] 


EQ 
Out lees 
SEQ k = [1 FOR block.size] 
PAR 
Out ebusrend (Evi cms 
he Go chbbete el [ssciya, ae] 
Sint E 


-- input from four channels 
SE a = {1 FOR repetition] 
SEQ 


OU eer é 
SEO k = [1 FOR block.size] 
in -? buppere (BYTE i) 
one 


== ae to four channels 
E {1 FOR repetition] 


Out!) va 
SEQ k = 2 FOR block.size| 
Out !  bureere eYTE sls) 
SKIP 


[al UP OU DU tr cdic tap MGE Ine p aia lace 
SE oH = {1 FOR repetition] 


ait te cle . 
SEQ k = [1 FOR block.size] 
EAR 
in ? bufferO[BYTE k 
out ! bufferl[BYTE k] 


SKI 
SKIP 
IF | 
-- cpumode not = '0' then get the number of computations done 
cpumode <> '0' 
SEQ 
flag ! 'a' === flag iO) stCemeroceaume coubisy 
opnumber ? number --- receiving computations from cpu 
out ! number --- sending computations to transputer roo 
TRUE 
SKE 


-- main PROC inout.transfer0 
epee = '1') OR (cpumode = '2')) 
eee eae eee pepe oa cpumode, flag, counter) 
cpubusysum counter 
((epumede = a OR a oeeeene = '4')) 
AR 
“iotransfer0 ep ey cpumode, flag, counter) 


cpubusysum ( , NEOtmce L 
((cpumode = '5') OR (onion = '6')) 
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PAR 
eters erg Peon cpumode, flag, counter) 
cpubusyprod (flag, eee 
((cpumode = '7') OR (cpumode = '8')) 
PRI PAR _ 
iotransferO eee ee cpumode, flag, counter) 


cpubusyprod (flag, counter 
i RUE 


lotransferO (repetition, cpumode, flag, counter): 
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-- PROC byte.slice.transferO (VALUE repetition, cpumode) 
PROC byte.slice.transfer0 (VALUE repetition, cpumode)= 
== descrip elon 


mn RK KKARRARKERAKAKKAKAAKAKAAKAAKAKAARKARRAK KKK AK AAKKRAKAKAKAKEKKEKKKKAKE 


--- Initializes the buffers and executes ene spneccaumosseransren: 
--- plus, when applicable according to cpumode, one of the following: 
2S cpubusy.prod or cpubusy.sum. 

-- Uses global constant maxdlock size 


mmm KKKKKRAKRKKKAKKKKAKKKKKRRARKRRKRRKRKKKRKAKKRRKRKKKKKRRRKRRKKRKKRKKKRRKEKKRKRKK 


CHAN “flag, --- flags the cpu to stop 
counter : --- return the number of operations cpu did 


-- PROC transferO (VALUE repetition, cpumode, CHAN flag, counter) 
PROC transferO (VALUE repetition, cpumode,CHAN done, opnumber)= 

<= description 

ma- RAKE 

--- It executes sequentially several parallel transfers using the 
--- BYTE.SLICE procedures to) thom tuansoutemarooer. 


KRAERKKKKKKKKKKKKKAKKKKAKKKEKKKKKAKKRKKRKAKKAKKKKRKKKAKKKRKKKKAKKEK 


--- It uses the global constants sizetable, nr.of.sizes, repetition 
mmm KKAKAKRKAKK KERR AAA ARARARKRKRKR KAKA RARARARARRERERARARARKRRRRARKRKRK 


-- variable declarations 

VAR block.size, 
Band --- the number of operations done by the cpu. 
Cc : 

VAR bufferO Eo maxblock.size + a 

VAR bufferl [BYTE maxblock.size + 1 


SEQ 
-- initialize buffers 
SEQ i = [1 FOR maxblock.size] 
SE 
uffer0-BYTE i- i\8 
bufferl-BYTE i- TVS 


Se re 
aeu i = [0 FOR nr.of.sizes] 
E 


lock.size := Bee Lo ace 
== input and output handling 
-- input from one channe 
sae, = [1 FOR repetition] 

E 


OUE. (ta. 
i BYTE .SLICE.INPUT(in,buffer0,1,block.size) 
Ske 


== 1hnput/ OUEDUE SEO Onecmenanmer 
SEQ 3 = [1 FOR repetition] 
SEQ 


out ! ‘a! 
PAR , 
BYTE.SLICE. INPUT (in burtend ya, blocks sze) 
ae BYTE.SLICE.OQUTPUT(OUL buffer oloGt.cize) 
EB 


-- input from two channels 
SEQ 3 = [1 FOR repetition] 
SEQ 


QUE TI) ay 
BYTE.SLICE.INPUT(in,buffer0,1,block.size) 
SKIP 


“= input/output to two channels 
SEQ 3 = [1 FOR repetition] 
SEQ 
Oliiw to as 
PAR | 
BYTE. SLIGE.INPUI (in, butten0e ie lec cize)) 
BYTE.SLICE.OUTPUT(out,bufferl,1,block.size) 
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SIN DE 


-- input from three channels 
SEO 7) = [1 FOR repetition) 


SEQ 

CUtw as . 
Z BYTE SRI eh EUG Ine ourferO.) block.size ) 
KIP 


-- input/output to three channels 
ae [1 FOR repetition] 
E 


OU seas 
PAR 
Bree Sel CheemNEUL( in bufferO 1 block.size) 
oe Bolero ech ouleut Out, bUfferl —l) bDlock.size ) 


-> input from four channels 
oe = [1 FOR repetition] 


Olu a! 
SKIP BYTE .SLICE.INPUT(in,buffer0,1,block.size) 


eaOucpuc CO foun channels 
os = [1 FOR repetition] 


out ! ‘a' 
: BYTE.SLICE .OUTPUT(out ,buffer0,1,block.size) 
KIP 


pao OuleGMERanG Input an parallel 
SEQ 7 — | USEGR repetition] 
SEQ 
OWt. ta’ 
PAR 
BY ie. oLlGEaliNrUa{insoutterO" | pblock.size) 

a Bates @liGs,OUlbUt (curt putter] ,1,block.size) 
3 


SCL 
TF 
-- cpumode not = '0' then get the number of computations done. 
cpumode <> 'Q' 
SEQ 
Flag sieiear 
opnumber ? number 
out ! number 


TRUE 
Shir 


-- main PROC byte.slice.transfer0 
I 
pee uede = '1') OR (cpumode = !2')) 
AR 


transferO (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter 
(({cpumode = '3') OR (cpumode = '4')) 
ee ee 


transferO (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter 
((cpumode = '5') OR (cpumode = '6')) 
Pak 


transferO (repetition, cpumode, flag, counter) 
coubusyprod (flag, counter) 
((cpumode = '7') OR (cpumode = '8')) 
PRI PAR . 
transferO (repetition, cpumode, flag, counter) 
cpubusyprod (flag, counter) 
TRUE a 
transferO (repetition, cpumode, flag, counter): 
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-- PROC int.transferO (VALUE repetition, cpumode) 

PROC int.transferO (VALUE repetition, cpumode)= 

-- description. - . 

mn RRKKKARAKKKK KK KKK KAARKARARKRKAKRAKAAKRKRKEKAAARKEK AA AKARAKKKAKRAKKKKKERKAA 
--- Initializes the buffers and executes the procedures intransfer, 
--- plus, when applicable according to cpumode, one of the following: 
--- cpubusy.prod or cpubusy.sum. 


-- Uses global constant maxwordblock.size 
KR ERK ERR REKRRKERRRKERRERKR RRR RRERERERRRRKRKKKEKKRKEKARKAKKRERKKAKERKER 


CHAN flag, -==- flags the ecpumvoe-uop 
counter : --- return the number of operations cpu did 


-- PROC intransfer0) (VARUB repetsttengmcemocdemmenaN flag seouncecrs 
ane intransferO (VALUE repetition,cpumode,CHAN done, opnumber)= 
-- description 
mann KRERKKRRRKKKKRERKKKKKKKARRR KK AKERAKKK RE REKKKRRARAKKRERAKEKRRERKKK 
--- [Tt executes tiequentially cevenaiepayamlet=ememcters Wsiiagecas 
aos ape Ea Eeue pean ee to/from transputer root. 

e 


== -) PP eeusecuse g obal constants sizetable, nr.of.sizes, repetition 
mann KRKKKKERKKKKERKKKARRKARKRARKRRRKKRKKRKRARRRRKKRKR ARR RRERARKERRRRKR 


-- variable declarations 

VAR Bieeios ize, . 
Ba --- the number of operations done by the cpu. 
¢ $ 

VAR woufferO [maxwordblock.size + 1]: 

VAR wbufferl [maxwordblock.size + 1]: 


SEQ 
o> Initial icemoemmeis 
SEH = [1 FOR maxwordblock.size] 


Wpuggerol 4 
woufferl|1 
Pe 


. 
1h 


oes = [0 FOR nr.of.sizes] 
i 


sizetable[{i] < 4 . 
aise ---minimum number of bytes is 4 for integer transfer 
U 


on 
ieee. size := Se a ean 
=> Inpub and Output andiang 
-- input from one channel 
SEQ oh = {1 FOR repetition] 
5 


Owe! ode 
SEQ k = {1 FOR gee tc a 
in ? wbhuffer0O[k] 
Sickle 


== input) Cub ou Gee cmron masa 
SEQ j = [1 FOR repetition] 
SEQ 


Out as 
ae = [1 FOR (block.size/4) ] 
AR 


in ? wbufferO[k 
out ! wbhufferl[k] 
Se 


-- input from two channels 
SEQ j = [1 FOR repetition] 
SEQ 
OUE cle 
SEQ k = [1 FOR paecinss2 e/a 
in ? wbufferO[k] 
SKIP 
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S=e2npwe/ Cutput to two links 
SEQ aa [1 FOR repetition] 
ou, f'a; 
SEQ k = [1 FOR (block.size/4) ] 
PAR 


nie putter ark) 
out ! wbufferl[k] 
SKIP 


-- input from three channels 
SEQ oy [1 FOR repetition] 


Ol Cee ; 
SEQ k = [1 FOR aaa 
in ? wbuffer0O[k] 
SKIP 


-- input/output to three links 
EEO" 7 [1 FOR repetition] 

SEQ 

OUGr. a. 


[1 FOR (block.size/4) 


if 2 pe eos 
out ! wbufferl[k] 
SK EP 


~= ae from four channels 
SE = [1 FOR repetition] 
EQ 
CUE to a. 
SEQ k = [1 FOR (block.size/4) ] 
in ? wbuffer0O[k] 
SKLP 


PoeoUePpUes oOo) LOUR channels 
ae 2 % 


[1 FOR repetition] 
oats ca. 
SEQ k = 


[1 FOR (block.size/4) ] 
out} “woutterO|[k] 
STP 


we All outpue and amputlin parallel 
ae } = [1 FOR repetition 


Yr va 
320 = [1 FOR (block.size/4) ] 
AR 
in ? wbuffer0O[ Le] 
me out ! woutferi([k} 
SKIP 
SKIP 
ish 


-- cpumode not 
epics CoO 


flagal2 tae 
opnumber ? number 


'O' then get the number of computations done 
out ! number 


TRUE 
SREP 


ae main PROC int.transfer0 
F 
((cpumode = 
PAR 


'1') OR (cpumode 


intransfer0O { 
cpubusysum ( 


= '2')) 
repetition, cpumode, flag, counter) 
Fach, counter 
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({cpumode = '3!') OR (cpumode = '4')) 
PRI PAR a 
intransfer0O ir epee cpumode, flag, counter) 
cpubusysum (flag, counter) 
((cpumode = '5!') OR (cpumode = '6')) 
PAR 


intransfer0O eee cpumode, flag, counter) 
cpubusyprod (flag, Somes 
((cpumode = '7') OR (cpumode = '8')) 
PRI PAR - 
intransfer0O jrepet ley cpumode, flag, counter) 
see cpubusyprod (flag, counter 
R 


intransferO (repetition, cpumode, flag, counter): 


-- PROC word.slice.transfer0 (VALUE repetition, cpumode) 

PROC word.slice.transfer0O (VALUE repetition, cpumode)= 

aameosenlOCLOn. 

mae RARKARKARKARKAKKRARKAAKKAAKAKRAKKARRAKRAKERK RAKAAAAKRAKAAKAAKAKKRRKK 
=-— Initializes the buffers and executes the procedures wordtransfer, 
Seep lus, wen spplicablesaccording to cpumode, one of the following: 
=== cpubusy.prod or cpubusy.sum. 


-- Uses global constant maxwordblock.size 
mmm RAKKKARAARKKAAAAKKKAKKAARAKRA KAKA KKK RARKKRAKERARAAKKAAKARKAKERARKAKKK A 


enAN “flag, et Lags ene cow tO stop 
SOtntererman return thea number Of operations cpu did 


~~ PROC wordtransferO (VALUE repetition, cpumode, CHAN flag,...) 
PROC wordtransferO (VALUE repetition,cpumode,CHAN done, opnumber)= 
= Cescilouren 
mee REKRKAAKARKAAKAARKAARKKAAKAKAKAAKARAKAARKAR AAR ARARRR AAR RAKERARAEKKE 
Sone 6 XeGuices SeaueneanLy several parallel transfers using the 
So aeue nD) ole en Dhecedimes  )to/ from transputer root. . on 
Taree CwuUses ene global constants sizetabie, nr.of.sizes, repetition 
RRARRKKAKKARKRERAKRRRRRAKAKARARARARKRKARKKRKARARAKER 


eee eg eee, 


Sate MARA KRKRRARRRK 


-- variable declarations 

NAB eel. S127, 
aaa --- the number of operations done by the cpu. 
Cc : 

VAR wbuffer0 maxwordblock.size + 1]: 

VAR wbufferl [| maxwordblock.size + 1 


SE 
== initialize buffers 
ee = [1 FOR maxwordblock.size] 
E 


‘pugferdf i 
woufferl/1 


' 
1 


SHOo 1 = 10 FOR nr.of.sizes|] 
ce 


sizetable[i] < 4 
SKIP ---minimum number of bytes is 4 for integer transfer 
TRUE 


SE 

ein aiive := ee ea 
== ThiplieaMcsol Lot nancdling 
ae omc mone =cilanimie st 
aD [1 FOR repetition] 


out ta! 
eee WORD.SLICE.INPUT(in,wbuffer0,1, (block.size/4) ) 


Se enouy OUEDUL GO one Tank 
aE {1 FOR repetition] 


Ole.) ha: 
PAR 
WORD.SLICE.INPUT(in,wbuffer0,1,(block.size/4) ) 
= WORD.SLICE.OUTPUT(out,wbufferl,1,(block.size/4) ) 


== anput from two channels 
See 3 = {1 FOR repetition] 
EQ 


out boa: ; 
WORD.SLICE.INPUT(in,wbuffer0,1, (block.size/4) ) 
SKIP 


Ta LnpU Ee OUEDUE COmewOuLinixs 
ee {1 FOR repetition] 


rot 


Olle ase 
PAR 
WORD.SLICE.INPUT(in,wbuffer0,1, (block.size/4)) 
a WORD.SLICE.OUTPUT (out, whuUrter i 1 (plock-size/aam 
Shee 


== Inpule Pir ommentee selene s 
SEQ 3 = [1 FOR repetition] 
SEQ 


tc 1 ral 


out 
- WORD.SLICE.INPUT(in,wbuffer0,1, (block.size/4) ) 
Sree 


sa INPUE/OMEDUL CO tnnec links 
SEQ 3 = [1 FOR repetition] 
SEQ 
out ! ‘a! 
PAR | 
WORD.SLICE.INPUT(in,wbuffer0,1,(block.size/4) ) 
ef WORD.SLICE.OUTPUT(out,wbufferl,1,(block.size/4) ) 
SN LP 


-- input from four channels 
SE j = [1 FOR repetition] 
EQ 


Out. Joa! 
ms WORD.SLICE.INPUT(in,wbuffer0,1,(block.size/4)) 
Sh LE 


-- OQUEDUE EO RnoulT ellanieds 
SEQ 3 = [1 FOR repetition] 
SEQ 


outs!) Ga: 
WORD.SLICE.OUTPUT(out,wbuffer0,1,(block.size/4) ) 
SKIP 


== all oWEpuL and anpue ne panaeve:: 
SEQ = [1 FOR repetition] 
EQ 


out ! a? 


WORD.SLICE.INPUT(in ,wbuffer0,1, (block.size/4) ) 
a WORD.SLICE.OUTPUT (out ,wbufferl,1,(block.size/¢) ) 
Shae 


ae 
~~ cpumode not = '0' then get the number of computations done. 
cpumoede <>) 70) 
SEO 
flag ! ‘a' 
opnumber ? number 
out ! number 


TRUE 
Sr 


se main PROC word.slice.transfer0 
eee = '1') OR (cpumode = '2')) 

ie 
wordtransferO (repetition 


cpubusysum (flag, counter} 
vecpumegs = '3') OR (cpumode = '4')) 
RI PAR 


cpumode, flag, counter) 


wordtransfer0 (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter 
(cp une = '5') OR (cpumode = '6')) 
A 


wordtransferO (repetition, cpumode, flag, counter) 
cpubusyprod (flag, counter 
((cpumode = '7') OR (cpumode = '8')) 
PRI PAR 


Io2 


wordtransfer0O (repetition, cpumode, flag, counter) 
cpubusyprod (flag, counter) 

TRUE -_ 

wordtransferO (repetition, cpumode, flag, counter): 


==eorocedure body transfer0d.b003 


SEG 
answer := 'yY'! 
moeHilLE answer = 'Y! 
SEO 
cree Gs) 


ie cPUn cre. 
ims epee clon 
er 
eae h) — 
eee sn er (repetition, cpumode) 
cn = 
byte.slice.transferO (repetition, cpumode) 
ch = 'T! 
eee (repetition, cpumode) 
Cua = 
word.slice.transfer0O (repetition, cpumode) 
BRU 
| SKIP 
teens vey. 
mm KKKKKAKAKARKARKAKKAAKAKKAKAKAKAKAKAKAKAKKAKAKKAKAKAKAKAKAKKKKKKARKARKAR AKA 


pap Sie Or CODE IN TRANSPUTER 0 BOOS 


los 


~~ TRANSPULER eS over res 


== SC PROG transterieboge 

-- PROC transfer] .5003. (CHA rau 

PROC transferl.b003 (CHAN in,out) = 

== )C2SCrIDELOM = 

ma RRR ARR AR AKAAKARAKRRAR ARR ARR AR AA RAK KAA RAK AK K AAR RAK AR AK KARR AA RAKR AAR 
~~~ This is the outer procedure placed ommeransputen sie | mee cmeca 
--- global variables and constants, and all proceduvecmena ead iies 
--- transputer. It receives a construct type (ch), cpu mode (cpumode), 
--- and number of times each communication sequence (repetition), and 
--- calls accordingly one of the following procedures: 

--- - lo.transferl, 

--- - byte. slice. trans tem 

=== woah Ey Sr dhicime ts ot, 


--- - word.slice.transferl 
mmm KKKKKKKAKKKKKAKERRARER EKER RRR RARER RARER EAA AAA RRRARRERKRREKRR KKK RK KKK 


-- Link Definition 
DEF linkOin 
DEF linkOout 
DEF linklin 
DEF linklout 
DEF link2in 
Dee JinkZoue 
DEF link3in 
DSF Linkcour 


-- constant declarations 
DEF sizetable = TABLE [ 1, 2, 4, 8 


W~YTN Or Wo 


LO 332, 0¢ 120; Zoo mou. 


1024, 1280, 2048, 4096, 8192, 10000 |: 
DEF nr.of.sizes = 16: --- as counted from above table 
DEF maxblock.size = 10000: --- last from the above table 


DEF maxwordblock.size = maxblock.size/4: 
-- variable declarations 


VAR ch : --- choice of the user in type of construct 

VAR answer { === [ehoree of Che User Ineconetnue sas 

VAR cpumode_ : --= choice of the user in cpu operation concimmcmimn 
VAR repetition: --=- choice of the user in number of times to run 


-- SC PROC cpubusysum (CHAN flagl, counterchan) 

-- PROC cpubusysum (CHAN flagl,counterchan) 

PROC cpubusysum (CHAN flagl,counterchan)= 

== desemicelon 

ma RRR RRR KARAR KARA AKRAK ARK AK AK AKA KARA ARR RAR AK A RRR RR AK 
--- It keeps the cpu working in parallel (time sharing) with link 
--- transfers Oe doing sum operations . It Stops when it receives 
--- a flag by the channel flag! from thewerans (ema opoce aise mC nace 
--- being execuited concurrently. [te Outeuts soyvechanne Seco WnceigelTor 


--- the number of operations done. 
mK KKK KEK KR KKK KARE KEKE KE RARKKKKREKRRKKRRRKKKKRRKRKKKKRRKKKKERK 


VAR ayo, ey 
working, 
counter, 
ch : 


SEQ 
counvets 0 
working TRUE 
TE ce 
WHILE working 
ALT 
foccHian cen 
working := FALSE 
TIME as 
SEQ 
e := a+b 
Counter :=seoumeer ss ol 
counterchan ! counter: 
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-- descriptor 
EC O§e 


== SC PROC cpubusyprod (CHAN flagl,counterchan) 

-- PROC cpubusvprod (CHAN flagl,counterchan) 

PROC cpubusyprod (CHAN flagl,counterchan)= 

peeaeserlotlon 

man K KKK KKKRKAKAKARKAAAKARKAARAAKAKARERARKAR AKA RRAAARARRKKRAK AREER RARKARAKARARK 
--- It keeps the cpu working in parallel(time sharing) with the link 
--- transfers ee ieiiguawaa aiuEMouieae tons. It steps wien receives 
--- a flag by the channel flagl from the transfer procedure, that is 
Some lie —exeCubeGueecncuUrrentiy-. It jautputs by channel counterchan 
--- the number of operations done. 

mane RRRKKARAARKAEKKR RAKE RK RER AKA RAKAKARAKAKKRKRAA KAR AA RERRKKRK KR RKRA KR 


-- constants and variable declarations 


DEF number = 100 ==-= size of array 

VAR a[number + 1] ---- array of integers 
b[number + iy, ---- array of integers 
Sunetent “media yeor lntegesss 
CLOCK, J eiieedene—vaniasle to get time 
working, j= HOCH EO SLOP execution 
counter, ---- integer -number of operations done 


en 


SEQ 
-- initialize buffers and variables 
SEQ D [ 1 FOR number |] 
Sy 


afi] ss 3th 
ae 
Soe 
Sounter 
working TRUE 
WHILE working 
ALT 
Plagie 2 cn 


working := FALSE 
ive: Clock 
Se 


SEQ 1 = [1 FOR number] 

e{[i] := a[i] * b[i] 

counter := counter + number ---updates nr. of operations 
counterchan ! counter: 


Wil 
S 
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-- PROC inout.transferl (VALUE repetition, cpumode) 
PROC inout.transferl (VALUE repetition, cpumode)= 


-- description 
mn KERAKRARKKAKKAAKRKA RK AAKAAKKAKAKAKRARAKRARKARARAKRA RA AKKKAAKKKKKKKKKK 


--- Initializes the buffers and executes (tne procedines torranstern, 
--- plus, when applicable according to cpumode, one of the following: 
--- cpubusy prod oeepeubus cum: 


-- Uses global constant maxblock.size 
ARREKR AKA ARREK ARR KEK KKK RRR RKKR ARR KKKRRRRRKAKK AA RAAAAR RR RAK ARERR 


CHAN flag, “== flags the cpu to stop 
counter : --- return the number of operations cpu did 


-- PROC iotransferl (VALUE repetition,cpumode,CHAN done,opnumber) 

we lotransferl (VALUE repetition,cpumcde, CHAN done, opnumber)= 

[= CeCSCHTp elem 

nn RRR KRRRAKAR RK IAK AAR AA LR RAE Re RA 

--- It executes sequentially severally pana ile emamctgeneys ici tl cmmenre 

-.- A OPO CUll Seg eas to/from transputer root. 
e 


--- It uses t g obal constants: sizetable, nr.of.sizes, repetition 
wm RRKKKRKKRKARERRERERRRRRRRRRRRRRRRERRRRERRRARRRKRERRRRRRRARRRRRRRRE 


-- variable declarations 

VAR block.size, 
a --- the number of operations done by the cpu. 
Cc : 

VAR buffer0 ine maxblock.size + 1]: 

VAR bufferl [BYTE maxblock.size + 1]: 


OE 
_ Initialize buffers 
SEQ i = [1 FOR maxblock.size] 
OF 
upfeRnO-E tees 1\e 
bufferl-BYTE 1- iS 


Sir 
ee = [0 FOR nr.of.sizes] 
E 


lock.size := Steere 

=— Input and ou tpulmeiarnclins 

=> input from two enannels 

SEQ j3 = [1 FOR repetition] 
SEQ 


out ! ‘a! 
SEQ k = [1 FOR block.size] 
ie oo buarerO (BC laek 
SKIP 


=> INpUE OULU Le COmty om mics 
= [1 FOR repetition] 


OUte. ae 
Seon = [1 FOR block.size] 
AR 


in ?ebutferd | bys i 
Out but rene ieee 
Sik GF 


ao IN PUGe Grom enisec seltaiiniedrs 
SEQ j3 = [1 FOR repetition] 
SEQ 
Cute! aa. 
SEQ k = [1 FOR block.size] 
Tie ae ue hemor yer kT 
SKIE 
= Thpuc/OULoUte EOmelnec = inias 
SEQ j = [1 FOR repetition] 
SEO 
OU Epa 
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SEOuKe— oll ROR block size] 


AR 
iiewetrterO Byte k 
out tf bufferl[BYeE k] 
SKIF 


sel nea: Imeem four schanne ls 
SEQ ah [1 FOR repetition] 


Cut 1) ta! 
SEO k = [1 FOR block.size] 
Hele) Jo bbesc =a) eyaua k] 
Sb 


Ta OUCDULEEO  rOUr Ghannels 
SEQ Lane [1 FOR repetition] 


eur | Ve! 
SEQ k = = FOR block.size] 
eae fou. fero (BYTE «Kk | 
See 


-- all output and input in parallel 
SEQ Sh [1 FOR repetition] 


OUtye ae 
So = [1 FOR block.size] 


in ? bufferO[BYTE on 
Cut ! buffterl | BYTE 
SKIP 


Sr 
a 
-- cpumode NOT = '0' then get the number of computations done. 
cpumode <> '0Q' 
920 
done ! ‘a! 
opnumber ? number 


TRUE 
Sle 


==—MalnePROG INOUt. transiers 


F 
(Ceeunede = '1.') OR Ccoumede = '2")) 
“iotransferl repetition, cpumode, flag, counter) 
cpubusysum (flag, counter 
¢ cpamp ge, = '3') OR (cpumode = '4')) 


eeieerens fer ao dee oy cpumode, flag, counter) 
cpubusysum (£1 counter) 
Seo ees = 151) OR aie aineres 2. = "6 )) 


eae are rey i ieee cpumode, flag, counter) 
EE uavErCG flag, counter 
aNepamode e = '7') OR (cpumode = '8')) 


Neer sna fer Mee En umode, flag, counter) 
cpubusyprod (flag, mauncen) 
TRUE 
iotransferl (repetition, cpumode, flag, counter): 


isy 


-- PROC byte.slice.transferl (VALUE repetition,cpumode) 

PROG byte sli cémeeanstemes (VALUE repetition, cpumode)= 

-- description 

mann RRAARRAKKRAKAKKAAKAKAKAAKEKARKKAKARRAKA AAA AKARAAARARRAKKKKKKRRKKKKKKKR 
--- Initializes the buffers and executes the procedures transfer, 

--- plus, when applicable according to cpumode, one of the following: 
ae cpubusy .prod or cpubusy.sum. 


-- Uses global constant maxblock.size 
mann KARKKRAARARKAAKAAKAREKARARKARKARAARAARKARAAAAARARAAARARAAKARARKREKKKK 


CHAN flag, --= “fla@s stie cplrors sop 
countier : -=- retlirn the number cf operations cpu did 


-- PROC transferl(VALUE repetition, cpumode, CHAN done, opnumber) 

Das Eas tenet (VALUE repetition,cpumode,CHAN done, opnumber)= 

== deschlweren 

mm KAKKKEKKAKKAKKAKKAKKKKRAKK AK KKKAKRKEKRKKRK KK KKAKKERAKRKAAKARKAKKAKKRKKKKK 

--- It executes sequentially several parallel transfers using the 

--- BYTE. SLICE proceduirese io) montreal s piece umtee cies . 

--- It uses the global Constants: "sae eae termina omesdces, 

=== repetition 
KAKKKAKKKAKAKKKKAKKKKAKKKKAKKERKKRKKKKAKAAKKAKAKAKAKAKKAKKKKKKKKKE 


-- variable declarations 

VAR block.size, 
oeaa --- the number of operations done by the cpu. 
eG : 

VAR buffer0 eae maxblock.size + ae 

VAR bufferl [BYTE maxblock.size + 1 


SEQ 
SSinitia les oteae tes . 
SEQ 1 = [1 FOR maxblock.size] 


SE 
buf fer0-BYTE ae i\8 
bufferl-BYTE i- i\8 


Se 
SEQ 1 = [0 FOR nr.of.sizes|] 
SE 


lock.sizeé := Ste eee 
== Input alae CUED Umenanalenng 
== input from two Clammeds 
ore = [1 FOR repetition] 


Ollceniis ae 
er BYTE.SLICE.INPUT(in,buffer0,1,block.size) 
Ke 


== input/output to two channels 
SEQ 3} = [1 FOR repetition] 
SEQ 


Out bela! 
PAR | | 
BYTE.SLICE.INPUT(in,buffer0,1,block.size) 
BYTE .SLICE .OUTPUT (out ,bufferl,1,block.size) 
SKIP 


== inpiit from Chreemenodie 
SEQ j = [1 FOR repetition] 
SEQ 
Ou a 
BYTE.SLICE.INPUT(in,buffer0,1,block.size) 


== iInpUt/ OULSUISEEOr cise cmetanme tls 
SEe } = [1 FOR repetition] 
EQ 


Cult ©! Sea 


PAR 
BYTE.SLICE.INPUT(in,buffer0,1,block.size) 
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ree EVlecler  OUIFURGCOUE, burteril, Il block.size) 


-- input from four channels 
SEQ j = [1 FOR repetition] 
520 ne 
uel 


out ! 
ae BYTE chICH Jil (inesurterO,1 block. size) 


== OUtpuE to folly channels 
See 2 = [1 FOR repetition] 
SEQ 


OUuceE Tas 
ut BYIE SLIGH OUIPUI oun buttrer0,1 block.size) 


Sorat) OUCPEL ana @npucetn parallel 
eee 3 = [1 FOR repetition] 


on Y tq! 
PAR 
BY oe. SULCE,, UN DUM eMme bur fer ,.1 block. size) 
ae Babee SUCH OUTPUT (out, BEEEOCY. HP SrOCKk. size) 


SKIP 
IF 


-- cpumode not = '0' then get the number of computations done. 
cpumode <> '0Q' 
SEQ 
done ! ‘'a' 
opnumber ? number 


TRUE 
SKIP 


-- main byte.slice.transferl 
eopanode = '1') OR (cpumode = !'2')) 


transferl (repetition, cpumode, flag, counter) 
cpubusysum (f a9 counter 
Cc 


((cpumode = '3') OR (cpumode = '4')) 
Pal Par 
transferl (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter) 


Miceaned: =p Si) OR cpumode = '6')) 


ge accel repent en cpumode, flag, counter) 
cpubusypro aan counter) 
(,cpumode = '7!) OR (cpumode = '8')) 
PR ra 
transferl (repetition, cpumode, flag, counter) 
cpubusyprod (flag, counter) 
TRUE 
transferl (repetition, cpumode, flag, counter): 


-- PROC int. transrerl (VALUE reper triomcounodes 

PROC int.transferl (VALUE repetition, cpumode)= 

== Cesoutecron 

mann KERKKAAKRKKAKKKKRAKARAAKAKAK EKER KK RARARKARRAARKARK AKA AKKKKKAKKKAKAAKAK 
--- Initializes the buffers and executes the procedures intransfer, 
--- plus, when applicable according to cpumode, one of the following: 
--- Ce Ue Ue yeuues or cpubusy.sum. 


-- Uses global constant maxblock.size 
mann KKRKARAAKKAAKKKAKKKAKRKAKAAARAAARARKAKKAKRKAKAKAKAAAKAKARKAKKKAARKKAKK 
CHAN flag, ~== flags the coustomston 
counter : --- return the number of operations cpu did 
-- PROC intransferl (VALUE repetition, cpumode, CHAN done, oe 
PROC intransferl (VALUE repetition,cpumode,CHAN done, opnumber)= 
== Cesc hipielen 


mame KKKKKKRKKAKRKKKKARKAKKKRKKKKKRKKRKAKRKKRKRKKRKRKRKRRRRKRRKRKRKRKRKAKRKAKRKRKKKKRRKKK 


--- Tt executes sequentrdilly se Vena pee ay re ec aes more een copra 
--- using the ADR OE ou e ue PRIMNLeL Ves seo mene eadaseute tt Hook. 
--- It uses the global constants: sizetable, nr.of.sizes, 


--- repetition 
mann KREKKKKKKARKKKAKARKAKKKKKKARKKKARKRKKKKKKKARKAKKAKARKKKKKKKKKKAK 


-- variable declarations 

VARTOOck. suze, . 
Bana --- the number of operations done by the cpu. 
C : 

VAR wbufferO [maxwordblock.size + 1]: 

VAR wbufferl [{maxwordblock.size + 1]: 


SEQ 
mie) Itidvedict 2 yo Ura . 
ee = [1 FOR maxwordblock.size] 
E 


Y pufferd i 
woufferl[i 
P 


: 
aL 


e e 


oe = [0 FOR nr.of.sizes] 
jet 


sizetable[i] < 4 | 
SKIP ---minimum number of bytes is 4 for integer transfer 
TRUE 
SEQ | | | 
block.size := oo 
=> Input and goutput. hanging 
== Input SRom twouchanneus 
SEQ 3 = [1 FOR repetition] 
oe bat 
ac 


out 

SEQ k = {1 FOR eae 
in ? wbufferO[k] 

SKIP 


== iInput/ OULpU ceuomeW Ommetniics 
SEQ = [1 FOR repetition] 
Q 


OU Sade 
a = [1 FOR (block.size/4) | 
AR 


ine WEEE Ses 
out ! wbufferl[k] 
SKIP 


== input from thvee channels 
SEQ j = [1 FOR repetition] 
SEQ 


CUES a 
SEO k= [1 3ReR Ceara)! 
in ? wbufferO[k] 
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aa nee 


Sue 


= pte Orie iemco tibee minis 
ae 2 = {1 FOR repetition] 
SEQ 


out ! "at 
SEO k = [1 FOR (block.size/4) ] 
PAR 


in ? eee ee 
Out! woutferi| ic) 
Sis 


== input from four channels 
SEQ a [1 FOR repetition] 


Cue 1 ya. 
SHON co mel POR ee co 2?) 
in ? woufferO[k] 
SNE 


== CUCpUC Been -elliq=chnanne.s 
eae = {1 FOR repetition] 


Cue! ae 
SEQ k = [1 FOR (block.size/4)] 
eee out ! wbufferO[k] 


Sod CUE pEEoncernpiiesn paral lew 
or Q 2 = {1 FOR repetition] 
SEQ 


Stl tesiey a: 
SEQ ke = [1 FOR (block.size/4) ] 


i eee eee. 
out ! wbufferl[k] 


Siw le 
Ske 
IF 
-- cpumode not = '0' then get the number of computations done. 
epiimode <> '0' 
SE 
done ! ‘a! 
opnumber ? number 
TRUE 
Sy lg 


-- main PROC int.transferl 
((cpumode = '1') OR (cpumode = '2')) 
PAR 


intransferl (epee eno, cpumode, flag, counter) 
cpubusysum (flag, counter 
cccoumede, = '3') OR (cpumode = '4')) 


"intransfer1 epee atro Goumnecemmr lag, counter) 
cpubusysum ( counter) 
apunode =") OR a eeeccus = "6%))) 


mriRtransfer! ee eee umode, flag, counter) 
coubusyprod (flag, Beda cer) 
ee rade = '7') OR (epumode = 958")) 


intransferl epeeraenc umode, flag, counter) 
cpubusyprod (flag, See 
TRUE 
intransferl (repetition, cpumode, flag, counter): 
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-- PROC word.slice.transferl (VALUE repetition, cpumode) 

PROC word.slice.transferl (VALUE repetition, cpumode)= 

== Ceo G ee enenl 

mann RAAARRKARKAKRKKKAKKKKAKKRKAKARARRAARARAREKR KAKA RAR ARK RKAKKKRRKRKKKKK 
--- Initializes the buffers and executes the procedures wordtransfer, 
--- plus, when applicable according to cpumode, one of the following: 
--- cpubusy.prod or cpubusy.sum. 


-- Uses global constant maxblock.size 
mm RAKAARKRAAKEKRAKARAAKARKAKRARKKR RA RRAKAKRARKARKAARAKARKAKRARKARKRARARKKKK 


CHAN flag, --- flags the cpu to stop . 
counter : --- return the number of operations cpu did 


-- PROC wordtransferl (VALUE repetition, cpumode, CHAN done, oe 
cae wordtransferl (VALUE repetition,cpumode,CHAN done, opnumber)= 

= GesCripimiom 

mann RKKKAKRAAKAKRA AKA KKKAKAKKAAKAKRK AAKRAAKKARARARARARARARR KARA REKRERKK 
--- It executes sequentially several parallel transfers of integers 
--- using the WORD SLICE procedure to/from transputer root. 

--- It uses the global constants: sizetable einmonozes, 

“"= repeblLElon 

mane KARAKAKKARKAKKAKAKRARKAARARARKAAARAKKARAR ARK RAKARAR KAKA ARARKEKKR 


-- variable declarations 
VAR block.size, 
aay c --- the number of operations done by the cpu. 
Cc Woe 
VAR wburfer0 H maxwordblock.size + i. 
VAR wbufferl maxwordblock.size + 1]: 
se et 
Sy Lee Gael i ee elee eta 
SEQ i= [1 FOR maxwordblock.size] 
Se 


wbugterot i} 
. Wh Wie eral a 


1 
1 


SEQ 1 = [0 FOR nr.of.sizes] 
[ 


sizetable[i] < 4 . 
SKIP--- minimum number of bytes is 4 for integer transfer 
TRUE 


SE 

feiyeiee tae := ee ae 
== Input and Cutpun banding 
==> 2npuUe Crom twoecicmmerks 
ae [1 FOR repetition] 


out ! ‘a' 
ages WORD.SLICE.INPUT(in,wbuffer0,1, (block.size/4) ) 
ug 


== input/ OUEDUGseo mL WOMmmaniiics 
SE = [1 FOR repetition] 


Out [Sezer 
PAR | 
WORD .SLICE.INPUT(in,wbuffer0,1)(block.size/4)) 
eis WORD.SLICE.OUTPUT(out,wbufferl,1,(block.size/4) ) 
S 


== input ftromecieccmemannes 
SEQ j = [1 FOR repetition] 
SEQ 


out ! ‘ay 
WORD.SLICE.INPUT(in,wbuffer0,1, (block.size/4) ) 
SKIP 


-- input/output to three links 
SEQ 3 = [1 FOR repetition] 
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SEQ 
our ! a 
PAR 
WORD.SLICE.INPUT(in,wbuffer0,1,(block.size/¢) ) 
oe POnp chiles OUlIPUL cut, wouttfer! 1, (block.size/4)) 
P 


-- input from four channels 
SEQ E = [1 FOR repetition] 
S 


out!) Wav 
— WORD.SLICE.INPUT(in,wbuffer0O,1, (block.size/¢4) ) 


== OMe pte r Onno liencranne Ls 
SEQ j = [1 FOR repetition] 
SEQ 


Out “at 
ao WORD.SLICE.OUTPUT (out ,wbuffer0,1,(block.size/¢)) 


Bae OUD UEEane Inoue tA parallel 
SEQ j = [1 FOR repetition] 


SEQ 
Cue ft 2a" 
PAR 
WORD.SLICE.INPUT(in,wbuffer0,1,(block.size/4) ) 
WORD.SLICE.OUTPUT(out,wbufferl,1,(block.size/4) ) 
SREP 
SKIP 
Tee 
-- cpumode not = '0' then get the number of computations done. 
Ccpumode <> 'Q' 
SEQ 
done ! ‘a! 
opnumber ? number 
TRUE 
SKIP 


-- main PROC word.slice.transferl 
((cpumode = '1') OR (cpumode = '2')) 
PAR 


wordtransferl (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter 
({cpumode = '3') OR (cpumode = '4')) 
Pier AR 


wordtransferl (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter 
cheno de ay EG on Fas dnode = '6')) 
A 


wordtransferl (repetition, cpumode, flag, counter) 
Gpubusyprod (Flag, Counter) 
((cpumode = '7') OR (cpumode = '8')) 
PRI PAR | 
wordtransferl (repetition, cpumode, flag, counter) 
st cpubusyprod (flag, counter) 
wordtransferl (repetition, cpumode, flag, counter): 


=e procedure body transferl.b003 
Q 

answer := 'Y' 

MEE answer = 'yY' 


itt ee eeeib 
ii mepumode 
in ? repetition 
Ip 

ch = ‘A! 


163 


mee oue jou ans toe (repetition, cpumode) 
Cc Ss 

, byte Steer go (repetition, cpumode ) 
ch = ! 

int.transferl (repetition, cpumode) 

= 


word.slice.transferl (repetition, cpumode) 


in ? answer : 


wee KKKKKKKKKKKKKRKKKRRKRRKRKKRRRRRRRRRRRRKRRRRRKKRRKRRRRKRKKKRRKRKRKKKRKRKKKKK 


--- END OF CODE EN TRANSEUT SRE ec 
KEKKKKAKKRRRKERRARKRARKKKAKKKAKKAKAKAARKKKAKKKKKKKEKKERERKERERERRRKEKREKERE 
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So wieenorULERZ BOOS-TDS 


J eoeeenoe trahsfer2.b0O3 

-- PROC transfer2.b003 (CHAN in,out) 
eReC emansterz-bO03 (CHAN imjout) = 
cece iol on 


--- This is the outer procedure placed on transputer 2 . It contains | 
Gael olea. Vvarianles and constants, and all procedures that run®in this 
Bemmeaasputer. It recelvyes aq construct type (ch), ¢pu mode (cpumode), 
--- and number of times each communication sequence (repetition), and 
peemca > decencingly one cf the following procedures: 

--- =o eeransLer2Z - 

aa - byte.slice.trangfer2_ 

se soma. GEransferzZ or 


--- ="VWOrGdeslLCe. Lhansters 
mmm KK KKK RRR KKK KKK KKK KKK KKK KKK KK KK KEK KR KKK KKK KK RRR KKK KK KKK RRR EKKXK 


-- Link Definition 
DEF linkOin 
DEF JimkOout 
DEF linklin 
DEF linklout 
DEF Link2in 
DEF link2out 
DEF Jaink3san 
Der Pink soue 


-- constant declarations 
DEF sizetable = TABLE [ 1, 2, 4, 8 


WU eat 
WIN HH OO 


Top oc Onn leone oo, O12 


. 1024, 1280, 2048, 4096, 8192, 10000 ]}: 
DEF nr.ofasizes = 16: --- as counted from above table 
DEF maxblock.size = 10000: --- last from the above table 
DEF maxwordpblock.size = maxblock.size/4: 
-- variable declarations 
VAR ch : e-- choice of the user in type of construct 
VAR answer $ === choice of the user in continue 
VAR cpumode ee eco he cmc mec UacimelEcOinOperaclon concurrently 


VAR repetition: --~ choice of the user in number of times to run 


-- SC PROC cpubusysum (CHAN flagl, counterchan) 

-- PROC cpubusysum (CHAN flagl,counterchan) 

PROC cpubusvsum (CHAN flagl,counterchan)= 

-=mcescri Detar 

mam RRR KARA KKKAAKKAKRAKKAAKRKAKRKAK KKK AAR RR KK AKAAKAA KARA KKKRAKARERKKAKK 
= eheeps sthemcoul working in ea (time sharing) with link 
--- transfers by doing sum operations . It Stops when it receives 
--- a flag by the channel flagl from the transfer procedure that is 
--- being executed concurrently. It Outputs by channel counterchan 


--- the number of operations done. 
See KAKA KARKRKKRKKRKAKKAKRKKRKRKKRKRRRRKKKKKKRKRRRRRRRKKKRRKRKKKRKRKKKKRKRKRE 


Varad, Se 
WOLKiaG, 
counter, 
ch : 


SEO 
counter 0 
working TRUE 
TIME ? a 


WHILE working 
AL 


Tr 
eG Peel 
working := FALSE 
TiMee: b 
SEV 
e :=artb 
counter --= counter + 1 
counterchan ! counter: 
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-- descriptor 
-- code 


-- SC PROC cpubusyprod (CHAN flagl,counterchan) 

-- PROC cpubusyprod (CHAN flagl,counterchan) 

PROC cpubusyprod (CHAN flagl,counterchan)= 

-- description 

ma KR RRKKRKAKAKAKAKRAKKR RK AAR RKKKAKKKRRRKKRRKKKRRKKARKKKRRKKKRKKKKKKK 
--- It keeps the cpu working in parallel(time sharing) with the link 
--- transfers by doing ay multiplications. It stops when receives 
--- a flag by the channel flagl from the transfer procedure, that is 
--- being executed concurrently. It outputs by channel counterchan 


--- the number of operations done. 
KKRKKAKKRKKKRAKRERK RRR AKA RERKRARKR AKER AKRK KK RARAKRKKKKRAAKKAKKREKKK 


-- constants and variable declarations 
DEF number = 100: SaaS aS ae Otc 
VAR af{number + 1] ---- array of integers 


binumber + ae ---- array of integers 

ejnumber + 1], ---- array of integers 

clock. ---- integer -varlable to get time 
working, =--- boolean =co Stop exeaur sam 
counter, ---- integer -number of operations done 


ch : 


SEQ 
-- initialize buffers and variables 
SEQ a [ 1 FOR number ] 
S 


afi ant 
ia Sa 
See 
GCOUnECE 0 
working TRUE 


WHILE working 
ALT 


Pag dee 
working := FALSE 
zane o Clocks 


SEO 1 = [1 FOR number] 
e[i] := a[i] * b[i] | 
counter := counter + number ---updates nr. of operations 
counterchan ! counter: 
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ee ee 


-- PROC inout.transfer2 (VALUE repetition, cpumode) 

PROC inout.transfer2 (VALUE repetition, cpumode)= 

== GeScCript ion 

Mn REKRKRKKARKAKKAKAKAKAKRARKAKRKAKAAKAAKAAKAKAKAKKRARAR AAR AKAKARAARAKKAKA 
--- Initializes the buffers and executes the procedures iotransfer, 
=== plus, When applicable according to cpumode, one of the following: 
“== sees eo oce or cpubusy.sum. 


-- Uses global constant maxblock.size 
mann RAKKKARKARRARKKKKKAKK RAK RKKRKARKARAAKAKKKAKKKKRARKAARARAAARARRARKR 


CHAN flag, =—-—  ElagsmeneuctuncOnsear 
counter : --- return the number of operations cpu did 


~~ PROC iotransfer2 (VALUE repetition, cpumode, CHAN done, counter) 
PROC lotransfer2 (VALUE repetition, cpumode,CHAN done, opnumber)= 


-- variable declarations 
VAR block.size, 
number, --- the number of cperations done by the cpu. 


Ghia: 
VAR buffer0O [BYTE maxblock.size + 1]: 
VAR bufferl {BYTE maxblock.size + 1 


SEQ 
== Staetlalize butters . 
see = [1 FOR maxblock.size] 


‘buf fer0-BYTE ae i\8 
bufferl-BYTE 1- ive 
Sree 


“he ‘OPFOR ar .of. sizes] 
E 
Dlock.size := Seeger 
See lnpUeadna OuEpUE nandalang 
Seu once enaee enanne ls 
SEQ at {l FOR repetition] 
= 


oud 


out! * a! 
SHO ea Pee nOR ep oek, 5126 | 
eae in ? bufferO[BYTE k] 


S2eOuULpUL, EOmenmee Channels 
eee = [1 FOR repetition] 
EQ 


COUuc asc. 
SEQ k = [1 FOR block.size] 
PAR 
In fepuererOuBVTE k 
out ! bufferl[BYTE k] 
SKIP . 


-- input from four channels 
SEQ Z {l FOR repetition] 
5 


OU, 6 aa: 
SEQ k = [1 FOR block.size] 
in ? bufferO([BYTE k] 
SKIP 


== OURDUE COM fOUul memannels 
SEQ a [1 FOR repetition] 
> 


out ! ‘a! 


SEQ k = [1 FOR block.size] 
out ! bufferO[BYTE k] 
SG 


sa cl GUEDUET dnd Tnewiein pakallel 
SEQ aan [1 FOR repetition] 
S 
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OU G Sear 
SEQ k = [1 FOR block.size] 


R 

in ? bufferO[BYTE k 

out ! bwé&ferwspBYTE k] 
SKEP 


2 
E 
-- cpumode NOT = '0' then get the number of computations done. 
cpumode <> '0' 
SEQ 
done ! '‘'a' 
opnumber ? number 


TRUE 
Ski 


-- main inout.transfer2 
((cpumode = '1') OR (cpumode = '2')) 
PAR 


iotransfer2 reper ey cpumode, flag, counter) 
cpubusysum (flag, counter) 
((cpumode = '3') OR (cpumode = '4')) 
PRI PAR i. 
lotransfer2 (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter 
at cpumnode = '5') OR (cpumode = '6')) 
PA 


lotransfer2 peepee cpumode, flag, counter) 
cpubusyprod (flag, Sanne 2) 
((cpumode = '7') OR (cpumode = '8')) 
PRA PAR 


iotransfer2 yee cpumode, flag, counter) 
Epubusyvpred (Lilac ecmnices) 
TRUE — 
lotransfer2 (repetition, cpumode, flag, counter): 


168 


== PROC byte.slice.transfer2 (VALUE repetition, cpumode) 

PROC byte.slice.transfer2 (VALUE repetition,cpumode)= 

SeaGese)r 1p taon 

ma KEKKARAKARARKKKRAAKKAKKKAKARKAKARARARAKARARARARAAKARKERAERARARAAKEKKR KA 
eee ceuee swt nemouTrers and executes the procedures iotransfer, 
--- plus, when applicable according to cpumode, one of the following: 
--- cpubusy.prod or cpubusy.sum. 


-- Uses global constant maxblock.size 
mana RRKKARKAKERKARAKAAKAARKAKRAARAAAARARRKAARKARARRKARKAKARKAAKKKAKAKKARKKA 


CHAN flag, =—— Vag Selec vem letOusitop . 
counter : --- return the number of operations cpu did 


-- PROC transfer2 (VALUE repetition, cpumode, CHAN done, counter) 
PROC transfer2 (VALUE repetition,cpumode,CHAN done, opnumber)= 


-- variable declarations 

VAR block.size, . 
EAE --- the number of operations done by the cpu. 
Cc : 

VAR bufferO Tae maxblock.size + 1]: 

VAR SUSrenn | BYTE maxblockesazest. 1}: 


SEQ 
Sapihiitoeize butters 
see = {1 FOR maxblock.size] 
ufferO-BYTE i- := i1\8 
bufferl-BYTE i- := i\8 


ine, 
ae = [0 FOR nr.of.sizes] 
S 


lock.size := sizetable[i] 
=a NpUGeeGOMecinee Channels 
SEQ 3 = [1 FOR repetition] 
SEO ath 
a 


CUE. 
oe BYViLE  ShICE INPUT (in, butter0,1,block.size) 
KI 


--SUneUr/OULOUEmGO titee channels 
SEO j = [1 FOR repetition] 
SEO 
out | ea 
PAR 
Ev leaoblenymimunainnourrerO,1,block.size) 
Brute CblGhreUlPUl( OuGsounferl,1,block.size) 
Sine 


== Inu EhOnesour chianne.s 
ED = [1 FOR repetition] 


ene . 
Bebe -SUICE-INPUL(in, putferO,1,block.size) 
Srole 


== OUBSUG EOm tour scinanne ls 
oe = [1 FOR repetition] 


Out) a: 
BY TEC oPMeEn Ou nP UM Oley olnterd,1,block.size) 
Sele 


-- all output and input in parallel 
ay PISPOR repetition} 


out > fa! 
PAR 
EVUE ollebethrun@in, burterO,1,block.size) 
—— BYTE.SLICE.OUTPUT(out ,bufferl,1,block.size) 


169 


SRGE 


Te 
-- cpumode NOT = '0' then get the number of computations done. 
cpumode <> '0' 
SEO 
Cone == a. 
opnumber ? number 
TRUE 
SKIP 


-- main byte.slice.transfer2 
((cpumode = '1') OR (cpumode = '2')) 
PAR 


transfer2 (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter) 

((geumode = '3') OR (cpumode = '4')) 
transfer2 (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter 

\ (cBamods = 75!) OR ven cnane = '6' 
R 


transfer2 (repetition, cpumode, flag, counter) 
cpubusyprod (flag, counter) 
((cpumode = '7!') OR (cpumode = '8')) 
PRI PAR = 
transfer2 (repetition, cpumode, flag, counter) 
cpubusyprod (flag, counter) 
TRUE oo 
transfer2 (repetition, cpumode, flag, counter): 
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-- PROC int.transfer2 (VALUE repetition, cpumode) 

PROC int.transfer2 (VALUE repetition,cpumode)= 

Pe Ceseh ior Lon 

mn RAKARKRAKKARAAKAAKKAKAKAAKAKKARARKAAKAAAARA RR AKA AARAAAAKAKAAKAKRARARK AK ARK KR 
--~ Initializes the buffers and executes the procedures intransfer, 
--- pius, when applicable according to cpumode, one of the following: 
ie Epublisy.prod OF Cplibusy.sum. 


-- Uses global constant maxblock.size 
man KKRAAEKRARKKAAKKKKRRA RARER ARK A KARARKA AKA AKA KARARAARAKRAKKRA KAR AK 


CHAN flag, se= Clagssene Coll to step ; 
counter : --- return the number of operations cpu did 


-~ PROC intransfer2 (VALUE repetition, cpumode, CHAN done,...) 

PROC intransfer2 (VALUE repetition,cpumode,CHAN done, opnumber)= 

= description.io 

mam KAKKARARAKAKKKAKKKAKAKKAAAKAAKAAAK KARA AK AAAAAKAKAAAKKAAKAKAKARKRKK 
=== st execticteés sequentially several parallel transfers of integers 
occ Liplc, OlGslEpIaimitlyes LO, from tbranspucer root. 

Pome comene OC lOvualeconstants:  sizetable, nr.of.sizes, 

“77 repetition 

mame RAKKAKAAKAKAKKAKAKKAKKAAKAAKKAKKARKARA AK AKRAAARKAAAKAAAKAARK AAR RAK ARR 


-- variable declarations 
VAR block.size, 
BB --- the number of operations done by the cpu. 
. : 
VAR wbuftferd eae e roe + in 
VAR wbufferl [maxwordblock.size + 1]: 
SIH >. 
Zea Ze our fers 
Ee = {1 FOR maxwordblock.size] 


EQ 

Beets lai 
woufferl[1 

ole 


i 
a 


Ee = (0 FOR nr.of.sizes] 


sizetable[i] < 4 


SKIP ---minimum number of bytes is 4 for integer transfer 


TRUE 


SE 

Enon := sizetablelé 
=— Input andwoutpuce hand lang 
Poe put LeOimunnce channe Ls 
a ae {1 FOR repetition] 


ute 4. 
SEO sie aalen OR maine 
Pee Weltrero) ik 
SKRLE 


[sSInouc/ OULDU ENO thueemlinks 
SEQ = [1 FOR repetition] 
SEQ 


OU eed: 
SEQ k = [1 FOR (block.size/4) ] 
PAR 


in ? wbuffer0O[k 
out ! wbufferl[ 
See 


== input from four channels 
SEQ ae (1 FOR repetition] 
° 


GUEo cara. 
SEQ k = [1 FOR tia 
in ? wbuffer0[k] 
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Sik 


== OUTpPUL CONroure eianiiemrs 
eh = [1 FOR repetition] 


Out ) ae 

SEQ k = [1 FOR (block.size/4)] 
ae out ! wbhufferO[k] 
5 


-- all output and input in parallel 
See j} = [1 FOR repetition] 
EQ 


OuC laa! . 
PE = [1 FOR (block.size/4) ] 
AR 
cee Ree ee all 
[kK] 


out ! wbhuffer 
SKIP 


Sher 
Me 


-- cpumode NOT = '0' then get the number of computations done. 


cpumode <> '0' 
sre 
done ! ‘a' 
opnumber ? number 
TRUE 
Sle 


-- main int.transfer2 


((cpumode = '1') OR (cpumode = !2')) 
PAR 


intransfer2 (repetition, cpumode, flag, counter) 


cpubusysum (flag, counter) 
((cpumode = '3!') OR eetnode = '4')) 
PRI PAR 


intransfer2 Wrepe arene cpumode, flag, counter) 
a 


cpubusysum ( , counter) 
icp ane ge = '5') OR (cpumode = '6')) 


intransfer2 paces (eae cpumode, flag, counter) 


cpubusyprod (flag, counter 
((cpumode = '7') OR (cpumode = '8')) 
PRI PAR 


intransfer2 eee ergnereG flag, counter) 


cpubusyprod (flag, counter 


TRUE 
intransfer2 (repetition, cpumode, flag, counter): 
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-- PROC word.slice.transfer2 (VALUE repetition, cpumode) 

PROC word.slice.transfer2 (VALUE repetition, cpumode)= 

me eeSertption 

wane AAKKARARKARAKRKAAKAAAKAKKKAAKKKAKAKAKAAKARKAAAAKA RAR KAKAARKA KARA KARA 
--- Initializes the buffers and executes the procedures wordtransfer, 
Zaaeotuopewien dpplicajle according to cpumode, one of the following: 
--- cpubusy.prod or cpubusy.sum. 


-- Uses global constant maxblock.size 
mame RAAKKRAAKKAKARKKARAKKAAKAKAAAAKK AKA RAKKAAKAAKAKAKK AAA KKARKAKRKKKKRARKAR 


CoA -c lac, “== flags the cpu to stop . 
GOunbeGecw=--—N  Getutn the number Of Operations cpu did 


-- PROC wordtransfer2 (VALUE repetition, cpumode, CHAN done, ae 
PROC wordtransfer2 (VALUE repetition,cpumode,CHAN done, opnumber)= 


-- variable declarations 
VAR block.size, . 
Baa: --- the number of operations done by the cpu. 
C : 
VAR wbuffer0 maxwordblock.size + 1]: 
VAR wbufferl maxwordblock.size + 1]: 
S10) 
peo melabize butters 
SEQ 1 = [1 FOR maxwordblock.size] 
SEQ 
Here eu la 1 
1 


Wouter it 1 
SKIP 


SEO =110 FORenr.of.sizes| 
IF 


sizetable[i] < 4 | 
SKIP ---minimum number of bytes is 4 for integer transfer 
Re 


SE 
blockers := See Peels | 
== ewe sdnc Ouro t samc! tng 
-- input from three channels 
SEQ 3 = [1 FOR repetition] 
SEQ 
UL 


out ! 
WORD .SLICE.INPUT(in,wbuffer0,1, (block.size/4) ) 
SREP 


-- input/output to three links 
SEOg] —all enor: Fepeti tion | 
SEQ 


Oue 17 'a' 
PAR | | 
- WORD.SLICE.INPUT(in,wbuffer0,1, (block.size/4) ) 
: WORD.SLICE.OUTPUT(out,wbufferl,1,(block.size/4) ) 
Sele 


Sst pU bet seme rOUhecManne l= 
SEQ j3 = [1 FOR repetition] 
SEQ 
GU a de 
- WORD.SLICE.INPUT(in,wbuffer0,1,(block.size/4)) 
SNP 


geeOUTCPUEBLORrouUr Channels 
See = [1 FOR repetition] 
SEQ 


Cut. a’ 
Pe WORD.SLICE.OUTPUT(out,wbuffer0,1,(block.size/4) ) 
SKIP 


Peo owe eticmdaceinput im parallel 
oe [1 FOR repetition] 


Il 


Out «!eer a? 

PAR 
WORD.SLICE.INPUT(in,wbuffer0,1, (block.size/4)) 
WORD.SLICE .OUTPUT (out, wbufferl,1, (block.size/4)) 


Sk ie 
See 
aot 
-- cpumode NOT = '0! then get the number of computations done. 
cpumode <> '0! 
SEQ 
done ! ‘'a' 
opnumber ? number 
TRUE 
Sie 


-- main word.slice.transfer2 
((cpumode = '1') OR (cpumode = '2')) 
PAR 


wordtransfer2 (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter) 
((cpumode = '3') OR (cpumode = '4')) 
FRI PAR 


wordtransfer2 (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter) 
((cpumode = '5') OR (cpumode = '6')) 
PAR 


wordtransfer2 (repetition, cpumode, flag, counter) 
cpubusyprod (flag, counter) 
((cpumode = '7!') OR (cpumode = '8')) 
PRI PAR - 
wordtransfer2 (repetition, cpumode, flag, counter) 
cpubusyprod (flag, counter) 
TRUE — 
wordtransfer2 (repetition, cpumode, flag, counter): 


-- procedure body transfer2.b003 
SEQ 
answer := 'Y' 
WHILE answer = 'Y' 
SEQ 
Ia? sen 
in ? cpumode™ 
shige 2 iqfsjefaigal ig skenel 
in 
Chas > 
BRC etn s Lge (repetition, cpumode) 
Ee = 
mee eco CE Ais So ae (repetition, cpumode) 
Cc = 
1 oe (repetition, cpumode ) 
ch = 'W 
word.slice.transfer2 (repetition, cpumode) 
TRUE 
SKIP 
In? answer 


— m= KKKKKKKKKKKRKRKKKKKKKKKRKKRKKRKKKRRKKRRARKKRRKKKRKKKKRKRKRRRRKRKKKRKRKKKRKKKK 


=== END OF CODE IN] TRANSEUTER ez 
mam RKKKAKKKKKKKAKAKAKKAKAKKKKKKAKKARKKKAKKAKKRKKKAKAKAKRAKKKKKKRARKRKKKKK 
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7 StrhalliorUlLERS BOOS.TDS 


Soot Oe transfer3.b003 . 

-- PROC transfer3.b003 (CHAN in,out) 

PROC transfer3.b003 (CHAN in,out) = 

=——Gescription 

mae KARA AKAAKAAKKKAKKKKAKAKKRARKAKKKKKK KKK AKER AKKKKAARKAKAKKAKKKAKKKKKKK 
[oes Isethe Outer procedWae placed On transputer 3 . It contains — 
--- global variabies and constants, and all procedures that run in this 
--- transputer. It receives a construct type (ch), cpu mode (cpumode), 
--- and number of times each communication sequence (repetition), and 
weomeceus aecordingly one of the following procedures: 

--- - lo.transfer3, 

==> - byte.slice.transfer3, 

ea = 2hieaeranster3 or 


--- - word.slice.transfer3 
mm KRKKKKRKAAKAKK KAKA KAK RRR RE RA RA RA KR REREREREREKRRERERERERERERERERERREKKRE 


-- Link Definitions 

DEF linkOin = 

DEF linkOout 

DEF linklin 

Dees Linkroutr 

Der wlank2in 

DEP Link Zout 

DEF link3in 

DEF Link3out : 

-- constant declarations 

Der eetzeree ee — Pheu oo, 16,052, 64, 128, 256, 512, 
1024, 1280, 2048, 4096, 8192, 10000]; 

DEP inreot.slzes = 16; --- as counted from above table 

DEF maxblock.size = 10000: --- last from the above table 

DEF maxwordblock.size = maxblock.size/4: 


-- variable declarations 


Hou We tk uw ue at 
WAIN OH OO. 


VAR ch : --- choice of the user in type of construct 

VAR answer : --- choice of the user in continue 

VAR cpumode : --- choice of the user in cpu operation concurrently 
VAR repetition: --- choice of the user in number of times to run 


-- SC PROC cpubusysum (CHAN flagl, counterchan) 

-- PROC cpubusysum (CHAN flagl,counterchan) 

PROC cpubusysum (CHAN flagl,counterchan)= 

Se ceschipeten.cyu 

mann RAKKKRKEAAKAAAKAKKAKAKARKKARKKKARKR ARK AARKKRKRARRKKRAKRKKRRRKERAKKKR 
--- It keeps the cpu working in parallel (time sharing) with link 
=-- transfers by doing sum operations . It Stops when 1t recelves. 
--- a flag by the channel flagl from the transfer procedure that 1s 
--- being executed concurrently. It Outputs by channel counterchan 


--- the number of operations done. 
mae RKKAAKAKARAKKAKKKAKAK RAR AARRAKRRKRKARKAKRKKARAKARKAKAKAKRKRRKRRERRERKRK 


wee KKAKKKKKARKKRKKRAKKKKRKKKRKKRRKRRRRRRKRKRRKRKKRKKRKRKRKRRKKKRKKRRRRRKRRRRRRRRK 


VAR 4,D,e, 
working, 
COUunEer, 
Gli « 


SEQ 
counter 0 
working TRUE 
TOME 7 a 
WHILE working 
ie 
duce ee ay (es 
working := FALSE 
TIME ? b 
550 
e :=artb 
counter := counter + l 
counterchan ! counter: 


is 


=="Gescripeor 
-- code 


-- SC PROC cpubusyprod (CHAN flagl,counterchan) 
-- PROC cpubusyprod (CHAN flagl,counterchan) 
PROC cpubusyprod (CHAN flagl,counterchan)= 


os escription 
mame  KKKKKRKKKAKKKRKAKKKRKRKRKRKRKKRKRRKKKKRKKRRRKRKKARKARKRKKRKRKKRKRRKKKARARRKRKKKRKARARKKRK 


--- It keeps the cpu working in parallel(time sharing) with the Wink 
--- transfers Py doing array multiplications. It stops when receives 
--- a flag by the channel flagl from the @inanster phocecure, sulla aac 
--- being executed concurrently. It outputs by channel counterchan 


--- the number of operations done. 
RAKAKKAAKKAAAKKRKARARKAAKAKAAKKRKAKARAKAAKRKAAKKKAKRARAAKRAKKRKKKAKKKKR 


-- constants and variable declarations 
DEF number = 100: =-==="size of array 


VAR a[number + : ---- array of integers 
b{number + |. ---- array of integers 
e(mumber + || ,. Soooeagbay Obelmecde ts 
cloel, s=-- integer -Variable "oO gauge. 
working, ==== boolean -tO Stop execution 


counter, ---- integer -number of operations done 
che 


SEQ 
== initialize buffers and variables 
oe i = [ 1 FOR number |] 

EQ 
a 


He := 3%1 
Dili ees=" Sa 
SKIP 

COU er 0 
working := TRUE 


WHILE working 
ALT 
Plagiegw en 


working := FALSE 
ae ? Cloek 


SEQ 1 = [1 FOR number] 
e{i] := ali] * b{i] 
counter := counter + number ---updates nr. of operations 
counterchan ! counter: 
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-- PROC inout.transfer3 (VALUE repetition,cpumode) 

PROC inout.transfer3 (VALUE repetition, cpumode)= 

Jae Serpe lon 

mann RARAKRARARARKAKAARKAAKAAKKARRKRARKRKRARARKAKARARARARARARARARARARARAR RAR 
--- Initializes the buffers and executes the procedures iotransfer, 
--- plus, when applicable according to cpumode, one of the following: 
lle cpubusy.prod or cpubusy.sum. 


-- Uses global constant maxblock.size 
mma KAAKKRKAAAARARAKAARKAKARKARK RAR AKARRAARKAKAARAAARAARAAKARAAAKARARKARARKKAR 


CHAN flag, === flags the cou teestop . 
SOumeel =: —-- “PerUrn the mumber Of Operations cpu did 


-- PROC iotransfer3 (VALUE repetition, cpumode, CHAN done,...) 

EHS lotransfer3 (VALUE repetition,cpumode,CHAN done, opnumber)= 

=-" Geese Eupeion 

RR AA RA Rae RRA RR RAI 
--= Jt executes sequentially several parallel transfers of bytes 
=== Using, the input/output primitayes to/from transputer root. 

a ec eme  ODemmeemordhiisr Sizetagie, nr.Of.sizes, 


= ieee t LET 
man RERAAKAKAAKKAARKA AKA AKAKRKAAAKKAKRKARKRKARRARARAKRKAAKRARARRARRKARKRAARAAR A 


-- variable declarations 

VAR block.size, 
number, --- the number of operations done by the cpu. 
chia): 

VAR bufferO [BYTE maxblock.size + 1]: 

VAR bufferl |BYTE maxblock.size + l 


SEQ 
-- initialize buffers 
SEQ 1 = [1 FOR maxblock.size] 


oe. . . 
iene EOS eaghh 1= i\8 

bufferl-BYTE i- 1.6 
SRL 


See = [0 FOR nr.of.sizes] 
5 


leek.size := peeet eb relay, 
Se Spee BEC BOUr eehanne ts 
SEQ 3 = [1 FOR repetition] 


SLi!) a 
SEQ k = {1 FOR block.size] 
ee in 2 burfereyByvte k] 


-- output to four channels 
SEQ 3 = [1 ‘FOR repetition] 
SEQ 


ee ey 
SEO 8 =: | PeROR billoek.size] 
Gute! buttered EYTE. Kk} 
Skier 


-- all output and input in parallel 
SEQ a0 [1 FOR repetition] 
S 


Out aeed. . 
SEQ k = [1 FOR block.size] 
PAR 


in ? bufferO([BYTE k 
out ! bufferl[{BYTE k] 
Se 


SKIP 
ie 


-- cpumode not='0' then get the number of computations done. 
cpumode <> '0Q' 
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SEQ 
done ! ‘a! 
opnumber ? number 


TRUE 
Sie 


-- main inout.transfer3 
Te 
((coumode = '1!) OR (cpumode = '2')) 
be 
lotransfer3 (repetition, cpumode, 
cpubusysum (flag, counter 
((cpumode = '3!') OR (cpumode = '4')) 
PRI PAR el 
iotransfer3 (reper rarer cpumode, 
cpubusysum (flag, counter) 
((cpumode = '5!') OR (cpumode = '6')) 
PAR 


iotransfer3 eee cpumode, 
cpubusyprod (flag, counter 
((cpumode = '7!') OR (cpumode = '8')) 
PRI PAR _ 
iotransfer3 ee ee cpumode, 
ane cpubusyprod (flag, Scena 
R 


flag, 


flag? 


flag, 


placer 


counter) 


counter) 


counter) 


counter) 


iotransfer3 (repetition, cpumode, flag, counter): 
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-- PROC byte.slice.transfer3 (VALUE repetition, cpumode) 

PROC byte.slice.transfer3 (VALUE repetition, cpumode )= 

-SeCe cau oO 

man RAAAKKKKAKKAAKKAKRAKAKARKARRKKAKARAKRAKKA KKK RAAKKAKAKRKAKKKKAAARKRRKKARK 
-=- Initializes the buffers and executes the procedures transfer, 
--- plus, when applicable according to cpumode, one of the following: 
eles cpubusy.prod or cpubusy.sum. 


-- Uses global constant maxblock.size 
man KKRKARAARKAKAAAKAKKAAK AK AKRAR ARK AR ARK AKA AAR AAR ARK ARKA RAR AARARARRARK A 


GHaNGE lac. == dco meneecous tous top . . 
counter : --- return the number of operations cpu did 


-- PROC transfer3 (VALUE repetition, cpumode, CHAN done, a 
ae transfer3 (VALUE repetition,cpumode,CHAN done, opnumber)= 

eg ER RR RK REKEREREKR 
aa ere eee Mee o seGQUuehira Vee wemcludamalle)! transters Of BYTES 
oo tniceeie by IE om herepmeceClnec mle, arom Cransputer root. 

--- It uses the global constants: sizetable, nr.of.sizes, 


=="repetl eon 
mae KAKEKKAARKAKAKK ARK AKAKAAKKAKAARAARAKK AR AKAKRARARRARARKRAAKRARKAARK ARK 


-- variable declarations 

VAR block.size, . 
AR --- the number of operations done by the cpu. 
& : 

VAR bufferO [BYTE maxblock.size + 1]: 

VAR bufferl [BYTE maxblock.size + l 


oe 


Tele a ize burtens . 
SEQ 1 = {1 FOR maxblock.size] 


Si 
tee, Oey be i= := 1\S 
bufferl-BYTE i- := i\8 

Sour 


SEQ 1 = [0 FOR nr.of.sizes] 
SE 


EGek.size := eeeec eras sy 

SEU c fron Loum ehannels 

SEQ 3 = [1 FOR repetition] 
SEQ 


Gules sae 
Boles eo Cre EU mirpeoueeet-O,1,block.size) 
Shorr 


2 OUCDUEme OnoUlercnanie /s 
SEQ j = [{1.FOR repetition] 
SEO 


ouc ! “ba! 
Bvtbece nee OUutrunlouresminrerd,1 block.size) 
SKIP 


=> all eCMEpUtsondeanpuc an parallel 
et [1 FOR repetition] 


out. 1 Sax 
PAR | | 
BYaib emer ivi outterO,1,block.size) 
ar BYTE.SLICE.OUTPUT (out, bufferl,1,block.size) 
Shr 


Serr 
ce 
-- cpumode not='0' then get the number of computations done. 
eeumode <> '(!! 
SEQ 
done ! ‘a! 
opnumber ? number 
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TRUE 
Sree 


-- main byte.slice.transfer3 
((cpumode = '1') OR (cpumode = '2')) 
PAR 


transfer3 (reps cae cpumode, flag, counter) 
cpubusysum (flag, counter 
((cpumode = '3!') OR Coun = '4')) 
PRI PAR 


transfer3 (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter 
((cpumode = '5') OR (cpumode = '6')) 
PAR 


transfer3 (repetition, cpumode, flag, counter) 
cpubusyprod (flag, counter) 
((cpumode = '7') OR (cpumode = '8')) 
PRI PAR | 
transfer3 (repetition, cpumode, flag, counter) 
cpubusyprod (flag, counter) 
TRUE _— 
transfer3 (repetition, cpumode, flag, counter): 
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-- PROC int.transfer3 (VALUE repetition, cpumode) 

PROC int.transfer3 (VALUE repetition,cpumode)= 

-- description 

mann RAKKKAAKRKAKKKAKKAAKRKARAKAAKKAAAAKKK RARER AKA AKAKAAKAARARARARRAR KAKA 
Pome uciaieces the burrers and executes the procedures intransfer, 
--- plus, when applicable according to cpumode, one of the following: 
--- cpubusy.prod or cpubusy.sum. 


-- Uses global constant maxblock.size 
manne KEKKARKARAAKKAKAKKKAAKAKAKARAKARAAAR AKA AAKARKRAAKAKAAAKAAKKAAKKAKK A 


CHAN flag, a2 f fags eee ce merous Cop . . 
GQuUnCEr 7 ===" Peruri the mumbel of Operations cpu did 


-- PROC intransfer3 (VALUE repetition, cpumode, CHAN done, a 
PROC intransfer3 (VALUE repetition,cpumode,CHAN done, opnumber)= 
== “CeSGhiptien 


wee KAAKAKKKKKKKKRARKKKKKKARARKKKRAKKKKRARKRKRKRRRARKRKRRARRRRKRKRKRKRRRKRRKRKKRRAKKE 


--- It executes sequentially several parallel transfers of integers 
--- using the input/output primitives to/from transputer root. 

Soe lees e ome. G_Ola mmaois eatats: ssl Zecqple, Nr.Or,Ssizes, 

ee eee avon 

mane RAKKARKARKAAAKKKAAAAAKAKKARARAARKKAR AK AARARAAKAAARKAAAKAAAAKRARRAKAK 


= VaAGioo le Ceclarations 
VAR block.size, 
aa. --- the number of operations done by the cpu. 
VAR wbuffer0 [maxwordblock.size + 1]: 
VER wbufferl [maxwordblock.size + 1]: 


SEQ 
aa athttialize buffers 
SEQ 1 = [1 FOR maxwordblock.size] 
SEQ 
Pateeoe ile H i 
i 


woufferl|BYTE 1 
SKIP 


oEe = {0 FOR nr.of.sizes| 


Sizetable[i] < 4 
SKIP ---minimum number of bytes is 4 for integer transfer 
TRUE 


SE 
Mock aise := SE aS cencs ea 
== input from four channels 
SEQ 3 = [1 FOR repetition] 
SEQ 
Que! ade 
SEQ k = [1 FOR Siamese 
in ? wbuffer0[k] 
SAGE 


=> OUEDUIE  tOmFOouln schanie is 
SEP j = {1 FOR repetition] 
EQ 


Ollie val 
SEQ k = [1 FOR (block.size/4) ] 
out ! wbhufferO[k] 
SHEE 


sedi OMeelceanoinpieeim parallel 
SEQ 3 = {1 FOR repetition] 
SE 


Giles! =" a! 
Seon = [1 FOR (block.size/4) ] 


See po eeeene ty 


out ! wbhuffer 
SKIP 


18] 


eye 
If 


-- cpumode not='0' then get the number of computations done. 


cpumode <> 'Q' 
SEQ 
done ! ‘a! 
opnumber ? number 


TRUE 
SHSP 


“= main int.transfer3 
((cpumode = '1') OR (cpumode = '2')) 
PAR 


intransfer3 Eley count cpumode, flag, 
cpubusysum (fl counter) 
Cone =..3 (DOR Feces = '4')) 


 anerare tes flag, count: cpumode, flag, 
cpubusysum (fl counter) 
etepuneds mee Sy) manteainoae meres) 


Oeeeneree Flag, count: cpumode, flag, 
cpubusysum (fl counter 
“A cpunede = ya) cpumode = '6!)) 


antares (renee tio cpumode, flag, 
coubusysum (flag, counter) 


counter) 


counter) 


counter) 


counter) 


intransfer3 (repetition, cpumode, flag, counter) 
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-- PROC word.slice.transfer3 (VALUE repetition, cpumode) 

PROC word.slice.transfer3 (VALUE repetition,cpumode)= 

[> Cesemlp Ero!) 

mann KAKRKAKKARARAAKARARKAAKAKAAKAKKKKARARKKARAKKKERERKRKERARKKKKRAKKKRKKKR 
Seeder Zeoetice) burters and executes the procedures wordtransfer, 
--- plus, when applicable according to cpumode, one of the following: 
am cpubusy.prod or cpubusy.sum. 


-- Uses global constant maxblock.size 
mm RAKKKRAARKAKKAKKKKKAKRK AKA AKA RK KAA KKKAKRKRKKRAKKRKRKRKKKAARAKAKKRKKKKR 


CHAN flag, ——— Welds Edewecoumto stop | 

@0uUnter = === return the number of operations cpu did 
SeueOeewOnderansters (VALUE bepetition, cpumode, CHAN done,...) 
PROC wordtransfer3 (VALUE repetition,cpumode,CHAN done, opnumber)= 
S-eaescriptilon 


mmm KARKRKAKKRKRKKRRRKRRKRKRRKKRKRKRKRKRKRKRKRKRRRKRKRRKKRKRRKRRRKRKRRKRKRRKRKRRKRRRKAKRKRKRRKRRRRRK 


--- It executes sequentially several parallel transfers of integers 
--- using the WORD.SLICE procedures to/from transputer root. 

Seen ocomae GlOovalecenctamcs: Sizetable, nr.of.sizes, 

~-- repetition : : . 

mane RARKRAKKKAKKKKRAKAKKKKKKKKAKAKKKKKKKKKAKAKKKKKRKAAKKKKRKKKKKARERKAKKK 


-- variable declarations 

VAR block.size, 
ae --- the number of operations done by the cpu. 
Cc =|: 

VAR wbufferO [ maxwordblock.size + 1]: 

VaR wbufrerl [ maxwordblock.size + 1]: 


SEQ 
waelnicetalize buffers 
SEQ i = {1 FOR maxwordblock.size] 
Ss 


eae rO(BYTE i] 
wobufferl|IBYTE i 
P 


- 
1 


Seen = (0 FOR nr sof <sizes| 
TF 


sizetable[i] < 4 : 
<cee  eiemu number of bytes is 4 for integer transfer 
R 


SE 
Siec ere := eiecedone ty 
> Input eanamowcput Nandling 
7 >) INPUcEntoimclr Channels 
SEQ 4, = [1 FOR repetition] 
SHOE 


Oue a! 
a WORD.SLICE.INPUT(in,wbuffer0,1, (block.size/4) ) 
ac 


=> OCULDUED tOeEoun cnanne ls 
aE j = [1 FOR repetition] 
EQ 


out. § ‘a! 
eae WORD.SLICE.OUTPUT(out,wbuffer0,1, (block.size/4) ) 


-- all output and input in parallel 
eo = {1 FOR repetition] 


Ou ea” 
PAR | | 
WORD.SLICE.INPUT(in,wbuffer0,1, (block.size/4) ) 
: WORD.SLICE.OUTPUT(out,wbufferl,1, (block.size/4) ) 
SKIP 


SKIP 
ee 
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-- cpumode not='0' then get the number of computations done. 
cpumode <> '0' 
Sao. 
done ! ‘a' 
opnumber ? number 


TRUE 
SKIP 


-- main word.slice.transfer3 
F 
((cpumode = '1') OR (cpumode = '2')) 
PAR 


wordtransfer3 (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter 
((cpumode = '3') OR (cpumode = '4')) 
PRI PAR 


wordtransfer3 (repetition, cpumode, flag, counter) 
cpubusysum (flag, counter 
((cpumode = '5') OR (cpumode = '6')) 
PAR 


wordtransfer3 (repetition, cpumode, flag, counter) 
cpubusyprod (flag, counter) 
((cpumode = '7') OR (cpumode = '8')) 
PRI PAR 
wordtransfer3 (repetition, cpumode, flag, counter) 
cpubusyprod (flag, counter) 
TRUE i 
wordtransfer3 (repetition, cpumode, flag, counter): 


-- main transfer3.b003 
SEQ 
answer := 'yY'! 
WHILE answer = 'Y'! 
SE 


OU ete. 2 
ie, sepumoges 
Ine ¢ Sepe eae Lom 
ne 
ch = ‘A’ 7 
inout.transfer3 (repetition, cpumode) 
Cc = 
4 DY eee ans eee (repetition, cpumode) 
Cc — 
ee Gua (repetition, cpumode ) 
ch = 'W 
word.slice.transfer3 (repetition, cpumode) 
TRUE 
‘SKIP 
Tee sansWer 
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mam KKAKKKKKARARKAKAKRKRKRKKKKRKKKRKKKRRKRKRKKKRKRKRKKRKRKRKRKKRKRKRKRKKRKRKKRKRKKRKRKRKKAKRKKKRRE 


== Seonraqunac lon 
mann RAKRKRKARAKAAKKAAKKKKAKKAKKKARAARKKKAKKAKKARERAAKRRA RA KARKARRAKKARA RRR 


== "ink Bee a Ons 
DEF linkOin 
DEE Tink0out 
DEF linklin 
DEG Tne lout 
DESL inkZin 
DEG i nhZout 
DEF link3in 
DEF link3out 
DEF root = 100: 

CHAN pipein[4],pipeout[4]: 


Pie Geer ren 
-—-PER@GeooOR ROOT 
PROCESSOR root _ 
PLACE prpean(ol, AT linkOin 
ee prpegut ae Ae aRE 
Epica) inklin 
PLACE pipeout 1 AT linklout 
PEACE pipeln} } AT link2in 
PLACE pipeout Pe eile 20 ite 


otk tk tk Gk tk a 
WIN HNHOOwW 


PLACE pipein[ Al Pinks 

PLACE pipeout[3] Jkayehicsierb te 

hostproc (pipein[0] jpivesalal ein[2], rpeint el. 
pipeout{0],pipeout| 1 in pipeou bl /pipeout [3] ) 


== CROCE Soon 0 

PROCESSOR 2 
PLACE pipein[0 AT linkOout 
PLACE pipeout [0] AT linkOin 


transfer0.b003 (pipeout([0],pipein[0] ) 


“= PROCESSOR 1 

PROCESSOR 1 | 
PLACE ea | AT linkOout 
PLACE pipeout([1] AT linkOin 


transferl.b003 (pipeout[1],pipein[1]) 


Se PROCESSOR -Z 

PROCESSOR 2 
PEACE Eee eel AT linkOout 
PLACE pipeout[Zz] AT linkOin 


transfer2.b003 (pipeout[2],pipein[2] ) 


~-sEROGZo50R 3 

PROGESSOK 3 
PLACE pipein[3 AT linkOout 
PLACE pipeout[3] AT linkOin 


transfer3.b003 (pipeout[3],pipein[3]) 
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APPENDIX F 
PROGRAM TEST LINEARITY 


-- header.occ 
eo KARR KKKKKKKKKKKKKKKKKKRKKKRKKKKR KKK KKK RR KR RR KKK KR KKK KKK KRKKKKKE 


-- x Title : Test Performance Linearity 

-- * Version : 2 

== x Hoda ao . 

-- * Author : Jose Vanni Filho, Ledr., Brazilian Navy 
-- * Date : June, 5th,1987 

== x Programming pangueds OCC AM ed 

a x Conipiler iS eeGCO moe iis 

== * Brief Description =ethas programmdppedaiaees 
ae - transputers shows aus the capabalaty to pmenc 

ee ‘ transputer in linear increase of performance 
-- i with the increase of the number of processors. 


== KARKAEKKKKRKARKRKRKRRRKRKRRARRKRKRRRRKRKRKRRRRRKKRKRKRKRKRKRKRRRRRKRKRRRRRRRKRKRKRRRER 


== Brief Desére otion aa 

--- This program nuns an 17 Erancoumems. 

--- transputer Root - prompts the user and triggers the other 
aa tidnspurers = 

=== - times the whole process execution 

=i = Heceives the nest esedWaesencdseor ene screen 
--- transputers 00,10,20,30 - execute two processes in parallel: 
aon - routes the trigger and the results, 

=== - executes the procedure counter 

--- transputers 01,02,03,11,12,13 20322722751) 37 ee oue 

=<5 - executes the procedure counter only 


-- PROGRAM testlinearityl7 

-- testlinearity 

== SC PROG hostunoc 

=~ PROG snOs toad 

PROC hostproc (CHAN A,B,C,D,E,F,G,H) = 


=< Bees de fimlenonse( pam eacis) 
-- Constants Definitions 


DEE wor c = 0: --- assign the i/o port of the BOO1 to the terminal 
DEF baud = 11:--- set the baud.rate to 9600 bps 

DEF null = 0: --- constantly used ASCII values 

DEF bell = 7: 

DEF tab = 9: 

ODS) Tlie = 10: 

DEF er = 13: 

DEF esc = 27: 

DEF sp = 32: 


-- Channels Definitions 

CHAN Parameters AT ; 

CHAN Screen : --- AT 1: |This placements cannot be done in TDS. The 
CHAN Keyboard: --- AT 2: |terminal.driver already takes care of that 
== Din Definitions 

DEF linkOout = 
DEE einen. 
DEE linkZoue 
DEF link3out 
DEFs inka 
DEF linklin 
DEF link2in 
DEF link3in : 


-- File Handler Control Values 


“HOP WNF © 


Hund uw ued 
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DEF ClosedOK ieee 
DEF CloseFile =) =2 
DEPe ORG Ws teiG Se 
DEF EndFile = -4 
DEF EndName = 5 
DEF EndParameterString = -6 
DEF Enanecord een) 
DEF NextRecord =e 
DEF OpenedOK = -10 
DEF OpénworRead == 
DEF OpenForwWrite = -12 


== Mpaoig) JOCemmn (Dartial) 

-- io_routines.occ (partial) 

=--"SuMtmeany Ofua/0o PROGs 

Po > OP ReCepev ine. 4 Sanewates a seheand a LF 

~=~ PROG@evetee string Outputs a string to theiscreen, byte by byte 
~~ Giledh. semeen Clears the screen and home the cursor 

--- PROC write.number displays an integer value in the screen 


-- PROC new.line 
mm KKKKRERKKRKRKRKRKAERKEKRARERRER RRR RERRRRERRERERERRERERRERRRERERRRKE 


--- Jumps to a new line on the screen 
mane RRKKAKRARAKKARAKRKKRAKK ARK RK ARK ERK ARARR AKA ARKRARRRARARKRARKRRRKREKKR 


PROC new.line = 
SE 


creen ! cr;lf;EndBuffer 


TepRO@Gmt tte string a value string | } 
mane KKAKKARRAAKARKARKKRA RAK RAK AR KARK RAR AKRRK ARR AK KKAKKKKARRRARAA KKK KK 


--- Writes a given string to the screen, in a byte by byte fashion 
mane KARKRKAKKKRRAKKKAKARR EKA RRA RA RARKKKARARKKRRRKRKRKKRRKKRERERKRERKKK 


PROC write.string (VALUE string| ]) = 
mane KKAAKKKKKAKKR AR AKA ARK KKK RR RAR ARAKAKRRRR RRR AKKKKAKKKKRRRRRRK KKK 


SE 
So 1 = {1 FOR string[BYTE 0] ] 
Serecn s strang(BYIE i] 
Screen ! EndBuffer : 


--PROC clear.screen 
mm KR RRR KERRIER ERE REE KKK KK KKK KKK KKK KKKKKKKKKKKKKKKKKKKKKKKKKEKKKKE 


--- Clears the screen. 
mm RK KK KEK AKKKKKKKRARKARAKKAAK ER KEAKKARKEAR RRR KAKKAKRKKAKKERKKEKKAKRKE 


PROC clear.screen = 
SE 


Screen ! esc; '-'; '2'; 'J'; EndBuffer --- clear screen sequence 
SGreemrmesce ae! s VE! os --- home cursor 

-- PROC write.number (VALUE number 

mm KKKKKKKKKKKEKKRKKKAKKKRAKRKRRRKKKKKKKRKKKKKKRKERKKKRKRKKKKKKKK 


--- This PROC outputs a signed integer value to the screen * 
mane RRARKKARKARAAKKAKAKAARRRA RAK RAK RRAAKA KKK RAR AKKRRRARAKREKKAKK 


PROC write.number(VALUE number) = 
Veeeoutput( 16), count, x: 


SEQ 
*X:= number 
eount:= 0 
IF 
re handle special cases 
a 
Screen ! 'Q! 
xe 0 
SEQ 
Screen ! ‘=! 
KsS=7R 
TRUE 
SKIP 
WHILE x>0 
-- construct number 
SEQ 
output(count] := (x 10) + '0! 


187 


COUNE <= coune + 1 
Ks= 3/10 


WHILE count > 6 
-- output number 


SEQ 
cOUNE :="counts! 
Screen ! outputlcoune | 
SIIL2 2 


me KK KKK KKAKKK KKK KKEKKK EK KKERKREKRKKARK RRA KR KKK RE KKKEKKKEK KKK KKEKRKKKE 
=-= Uti lleies.oce 

mm KKREKKERKKREKKKRKKEKREKRKEKEKKEEKKEREEKRERERR RK KRERKRERERREKREKEERER 
-- PROC ttck-to.f2me (VAnmm sect stop board. type) 

mm RK KARE KERR KKK ARR RERRRKRERRRRRERRRRRERRRRRARRREKRKRREREKRRKKE 


--- Recélives start and Stop cime wand board Soe cnameucruEs 
--- the elapsed time in hours, minutes, seconds and milliseconds 


man RKKKAKKRKRAKKRARAKERKRARARERKK ARERR KERKRE RA RAKKKKR KARR RERARRKRAKKKK 
PROC tick.to.time (VALUE start, stop, board.type) = 

-== board. type = 0 —— oe 2 Vils 

--- boandstype 2.1) -——= nou 

==> board.typet= 2 ooo o> wee ae 

--- board.type = 31----> B003 high priomian,.) 

--- board.type = 32----> BO003 IGWepraors tyes) 


eae Ose cols uole 4 ----> B004 


-- constant definitions 


DEF vax.sec =10000000 : ~-- hundreds of nsec/second 

DEF vax.mili = 10000 : --- hundreds of nsec/millisecond 

DEF b0Ol.sec = 625000 : --- # of 1.6 microsec/second 

DEF b0OOl.mili = 625 : --- # of 1.6 microsec/millisecond 
DEF bOO3h.sec = 1000000 =" "+ OL Mlenrosee7 second 

DEF bOO3h.mili = 1000 --- fe OLeleGnosec, Mtn Tce come 

DEF BOOSL.sec =" Se iseaz> --- # of 64 microsec/second 

DEF b0031.mili = 16 : --- # of 64 microsec/millisecond 

DEF max.number.of.ticks = 2147483648 : --- maximum integer (2%*%*31) 


VAR elapsed.tick : 
VAR factorl, factorZ 
VAR MS@C, EOL SEC, SEC iin 


SEQ 
Te 
board.type = 0 == ee VMS 
SEQ 
factorl := vax.sec 
factor2 := vax.mili 
board.type = 1 --- BOOL 
SEQ 
Paeeemi += D0Cl. sees 
factor2 := b0OOl.mili 
board.type = 2 =220 5.00 
SKIP --- will be implemented in the future 
board.type = 31 --- B003 in high priority 
Su 
Factor = b0O3h.sec 
factor2 := b003h.mili 
board.type = 32 --- B003 in low priority 
SEQ 
factorl := b003l.sec 
factor2 := b0031l.milil 
board.type = 4 os = BO04 
SKIP --- will be implemented in the future 


elapsed.tick := stop - start 
De 
elapsed.tick < 0 
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elapsed.tick := 
TRUE 
SAREE 


sec elapsed. elem factor] 
tot.sec/3600 
(fot Beg cao 1/50 


tot.sec 


EOu. 
hr 
min 
sec 
msec 


-- output time to screen 
write.number r 

Wekece .Seringe hr") 
write.number min) 
“elce.Serangai' min") 
write.number(sec) 

Grate. string (' secs!) 
write.number (msec) 
write.string ('' msec") 


ai PROC capitalize (VAR ch 


KAKKKKARKAKRKARKKARKRAKKKK 


elapsed.tick + max.number.of.ticks 


rele ae Giek factor!) /£actorZ 


KRAKAKAKKAKKAAKKKKAKKAKKKKAAKKKAKAKKKKKAKAKKKRKKKAARK 


ee ea any lower case character into upper case 
mame ARKKKKAKKAKKAKKAKKRRKRAKRKKRRKRRKRKRKKKKAKRRKKRKRRKARKRKKRRKRKRKRKKKRKARKKRKRARKRAKKK:E 


PROC capitalize (VAR ch) = 
DEF delta =('a!' - ‘'A') 


one 


en <=me'z ) AND (ch >= 'al) 
ch := ch - delta 


-- link placements 

hostinO AT linkOin: 
mostint At lanklin: 
hostin2 AT link2in: 
hostin3 AT link3in: 
hostoutO AT linkOout: 
hostoutl AT linklout: 
hoesecuce AL JinkZout: 
hostout3 AT link3out: 


-- PROC terminal driver 


A ---> 65 
a---> 97 ASCII values 
Z <---> 122 


KAAKKAKKKARAKKKKAKKKRKKRARKKRKRKKAKAKKKKKKKAKRKKKKRAKKRKRRKRKRKRKKRRAKKKKKKE 


The términal driver is the one provided by the manufacturer 
for the bOOL board and therefore is not included. 
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== PROC Use re tmecmt cee 
mR KKKKAKKKAKRRKERKKRKRKKEKKRKEKKEKKKKEK KKK KAKEAKAKKKKKAKER 
-=- RécéiveBelag some cee use ame See the network 
--- Receive results from the network an output to the screen 
wee we KKK KK KKKRKRKKRKKRKRKKRAEKKKKRKKRKERKRKKKRKKRKKKEKRE 
PROC user.interface = 

-- local constant and variable declaration 


KRRARRKKKKKKKKAKRKK 


DEF tot = 16 : ==S number vont canseuivenm. 

VAR Bel. --- flag from the user 

VAR result{tot] ; 

VAR startimeroot, endtimeroot: --- timers for the root 

VAR startime[toti, endtime[tot]: --- timers for the 16 transputers 
SEQ 


write. Sane Type any character to start ") 
Keyboard ? ch 
Dehee na ch 


new. line 

TIME ? startimeroot 

PAR 
-- send flags 
hostoutO ! ch 
NOstouri. ) echi 
HOS EOE ch 
hes tous =e 


ae Ce 1 VeGmues anes 
SEG. = [0 FOR 4] 
AR 


hostinO ? pesuitl1)| ;stantime ie eme eumodes) 

hostinl ? result/i+4 ;startime i+4 ;endtime[i+4 
hostin2 ? result{it+8];startime[i+8] ;endtime[i+8 
hostin3 ? result{itl i -startime[itl2]; endtime[{1i+12] 


SKIP 


TIME ? endtimeroot 
-- send results to the screen 
SEQ be) = [0 FOR tot] 


write.string pag nee ie) 
write.number (3) 
Serecn se se 
write.number (resultia) 
SCLCChe sop; esp 
tick.to.time (startime[j],endtime[j] ,32) 
new.line 
Si@EP 
-- send total execution time to the screen 
new.line 
write.string (" Time to execute in parallel ") 
WEIEe oS being 2 SeWaleh dag elec sll elas — lee 
tick.to.time (startimeroot,endtimeroot,1 


R 
IMS.BO0O1.terminal.driver(Keyboard,Screen,port,baud) 
user.interface : 


=a ead af code. fen; .< uenenn)  aneeuaeecs AANA 


--- End of code for transputer Roo 
mmm KKRKKRKEKKRKERKRKERKKE er Oe acne sk se NIC ee eae 
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-- SC PROC Route | 

-- PROC Route (CHAN messagein, messageout, routetol,...,VALUE k) 

PROC route(CHAN messagein,messageout, routetol,routeto2,routeto3, 
echofroml,echofrom2,echofrom3,VALUE k)= 


DEF 1= 4: --- number of counter procedures 

VAR msg : Sse ll 

VAR resuits[i] : 

VAR startime[1],endtime[{1]: --- timers 
CHAN softin,sortout: --- soft channels declared for communication 


=== with procedure counter. 
ae eran oc PcOUNnter 
m- FROG counter 
PROC counter (CHAN in,out, VALUE tnumber) = 
== Gescriptlen 


ee) Mies) Un aere:~ tetany OL ie cell 8s 28) 


--- seme: UP ees 100000 integers and add the transputer number 
===/tovtne tota 
wm A KRKKKAKKKKARRAKKKAAK RAK ARK KARA KRKKAKRAKK RAK KKK AAKKAAKRAAKRRKKKKKE 


DEF maxope = 100000: --- number of operations done 
VAR ch, total 


VAR startime3, endtime3: 


SEQ 
total := tnumber 
ieee Cha 
Hie a esegh eimes 
SEQ 1 = [0 FOR maxope] 
Occ: = eeocd lial 
TIME ? endtime3 
out ! total;startime3;endtime3: 


-- descriptor 
== —COde 


SEQ 
PAR | 
counter (softout,softin,k) 
mmenouleincg procedure 
SEQ 
messagein ? msg 
SEQ 
PAR 
POULcL OL: sg 
routetozd ! msg 
routeto3 ! msg 
softout ! msg 
PAR 
echofroml ? results-0O-;startime-0-;endtime-0- 
echofrom2 ? results-l-;startime-l-;endtime-l- 
echofrom3 ? results-2-;startime-2-;endtime-2- 
softin ? results-3-; startime-3-;endtime-3- 
a se otng PGme em UuCOUMre ses dnidselning 
SEQ i = [6 FOR 4] 
messageout ! results[i];startime[i];endtime[i]: 
Rn REKKAKAKAKK KR RKKAKRK KAR AKA KKK AK KAK AKA RA RAK RAKKAKK KKK KKK KKK KRKKKR 
--- End of code for transputers Routers 


Bes Tele cg 


(99 Ore 30) 
mee KKARAAKKRAKKAAAKRAARK ARK RAKARRARRRAEKR RAR RARRRARAAARRAKKRARKRERAKRERARKKR 


Loe 


== SC) rROG counter 

-- PROC counter (CHAN in,out, VALUE tnumber) 

PROC counter (CHAN in,out, VALUE tnumber) = 

“= G@SseripEton . 

nm KKK AKKKAKRKAKKAKAAAKKAKKAAAKKKKARAAKAKKAAAKAKKKARAAKKAAAAAAKAKAAKK 
--= Sums up the first LOCCOe ee eae and add the transputer number 


--- to the total. Sends the resul through channel out. 
me KK KKAKKAKKRKKAKERKRKKRKRRAKRR RR RKR AERA RRA KRERERERRRRRKKKRERKKKKKEKKE 


DEF maxope = 100000: 
VEReon peoradar 
VAR startime, endtime: 


SE 
otal := tnumber 
in cl 
TIME ? startime 
SEQ 1 = [0 FOR maxope] 
EO ad: = eo ral miaa 
TIME ? endtime 
out ! total;startime;endtime: 


mee KKKKKKKKKKKKRKKKKKAKKKRKKRKKRKRKKRKKKKKKKKAKRKKKRKKKKKRKKKKKKKKKKKRKKAKKKAKRKA 


=== ENG Of scocde meor transputers Leaves (Q1 O2 03 11,12) eA Zie cee. ) 
mm RKKKKKKAKKAKKKAKKARARKKREKAAREKRRRRKRRRARARRERR AAR RRR RRR ERR RERRERKK 


mmm KKKKKAKKAKKKKKKKKAKAKKKKKAKRKRAKAKRAKKKKAKKKKRKKKKKKKKKKRKKKKKRKKKKKKKKAKRE 


meow eeu de lol 
mn RARARARKARAKAKAKAKAKKAARKARAKRAARA RA AAKKKAAKKAKARKR AAA AARAAKARKRARARARAK 


-- link definitions 
DEF linkOin = 4 : 
DEF linkOout = 0 
DEPalanklin = 5 
DEF linklout = 
DEF link2in = 6 
DEF link2out = 

EF link3in = 7 
DEF link3o0ut = 
DEE root = 100: 
DEF totlinks = 32: 
CHAN pipe[totlinks]: 
PLACED PAR 


PROCESSOR root 
pein) Placements andepnocess assignment 


ab 
oe 
3 


Boner. Pipe | Oleat elinkOan, = 
PLACE pipe Be PlLeoue | 
PEACE pipe AT linklin 





1 

2 
PEACE papel) Ar Linkiout 
PLACE pipe|4| AT link2in 
BESCE plpewe eal LinkZout 
PACE pipe/6} AT Link3in 
PEACE pipeig | AL Jinksout 


hostproc apt ee bets tpe te te ips (Sl. 
pipe|1lj,pipe|3],pipe sy pipe ZA 
PLACED PAR j = [0 FOR 4] 
PROCESSOR 10*3 | 
=- link placements and Bees assignment 
PLACE pipe(2*)] AT linkOout : 
PLACE pipe| (2*j)+1] AT linkOin : 


PAGE pipe eee AT link2in : 
PLACE pipe|9+(6*3)] AT link2out 
Peper tpe et o-)) | At linklin = 
PE ebepepe  lit(o7))| AT linklout : 
Hiern pape | l2on]) | Abelink3 in 
PLAGE, Pipe LS eoas AT link3out 
route (pipe[(2*j)+1] ,pipe[2*j] ,pipe Qe yoay lL Dipelt ey. | 
eee eae aes Loe 6*j)-,pipe-12+(6*j )-,10%*j) 
PLACED PAR 1 = [0 FOR 4] 
PROCESSOR (10*i)+1 | 
pein Placements sand Process assignment 
Emer Pipe at tents | AT link3out : 
PEGE pipe (94+(672)) AT Link3in 


Gouneer pipe 9+(6~1) |) pipe[6+(671) |, ((10*1)+1)) 


PLACED PAR i = [0 FOR 4] 
PROCESSOR (10*i)+2 } 
-- link placements and process assignment 
PLACE Been AT linklout : 
PLACE pipe{11+(6*i)] AT linklin : 


Coumcer( pipe | li+{6%1) | pipe (10+(6%1)], ((10*1)+2)) 


PLACED PAR i = [0 FOR 4] 
EROGEOOOR (10*1)+3 
mein placements andwprocess assignment 
PLACE Pee te heats | poe Zoe: 
PEee@e pipe | )3+(6%1)} AT FankZin : 


Gouncer (pipe | 134(6*1) |epipe /[12+(6*1)],((10*1)+3)) 
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APPENDIX G 
TRANSPUTER PRODUCTS* 


a. Transputers 
e IMS T414B-GISS - 32 bit transputer - 1Smhz 
e IMS T414B-G20S - 32 bit transputer - 20mhz 
e IMS TS800B-G20S - 32 bit floating point transputer - 20mhz 
@ IMS T212A-G17S - 16 bit transputer - 17mhz 
e IMS 7T212A-G208S - 16 bit transputer - 20mhz 
e IMS M212B-G15S - Winchester and Floppy disk controller 
b. Evaluation Boards 
e IMS B002-2 - T 414 with 2MBytes DRAM with 2 x RS232 
e IMS BO003-1 - Described in Chapter I 
® IMS BO003-2 -4x T 414 - 20mhz each with 256KB DRAM 
e =6IMS B004-4 - Described in Chapter I 
e IMS BOO0S5-1 - M212 with 64kbytes SRAM, 20MB WINIT, 640K Floppy 
e IMS BO06-1 - [212 with 64kbytes SRAM, and 2 x RS 232 
e IMS B006-2 - T212 with 64kbytes SRAM, and 8 x 7212 (8k SRAM) 
e IMS BO07-1 - Graphics Evaluation Board with | T414, 512k DRAM 
c. Digital Signal Processing 
e. IMS A100-G20S - 32 Stage cascadeable signal processor 


* All trademarks on this page are registered trademarks from 
INMOS Group of Companies, Bristol, UK. 
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